(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 




Mil III II II II 1 1 1 II II I 1 1 1 II 1 1 11 [ I II I II! 



(43) International Publication Date (10) International Publication Number 

31 January 2002 (31.01.2002) PCT WO 02/08404 A2 



(51) International Patent Classification 7 : C12N 9/00 

(21) International Application Number: PCT/US0 1/241 1 1 

(22) International Filing Date: 26 July 2001 (26.07.2001) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

09/626,127 26 July 2000 (26.07.2000) US 

(71) Applicant (for all designated States except US): LARGE 
SCALE BIOLOGY CORPORATION [US/US]; 3333 
Vaca Valley Parkway, Vacaville, CA 95688 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): GARGER, Stephen, 

J. rUS/US]; 593 Cottonwood Street, Vacaville, CA 95688 
(US). TURPEN, Thomas, H. [US/US]; 160 Santa Fe 
Court, Vacaville, CA 95688 (US). KUMAGAI, Monto, 
H. [US/US]; 809 Plum Lane, Davis, CA 95616 (US). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
I IR, IIU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MX, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, 
TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(74) Agent: HALLUIN, Albert, P.; Howrey Simon Arnold & 
White LLP, Box 34, 301 Ravens wood Avenue, Menlo Park, 
CA 94025 (US). 



(54) Title: PRODUCTION OF LYSOSOMAL ENZYMES IN PLANTS BY TRANSIENT EXPRESSION 



00 



(57) Abstract: The invention relates to the production of enzymatically active recombinant human and animal lysosomal enzymes 
involving construction and expression of recombinant expression constructs comprising coding sequences of human or animal lyso- 
somal enzymes in a plant expression system. The plant expression system provides for post-translational modification and processing 
^ - to produce a recombinant gene product exhibiting enzymatic activity. The invention is demonstrated by working examples in which 
transgenic tobacco plants express recombinant expression constructs comprising human glucocerebrosidase nucleotide sequences. 
The invention is also demonstrated by working examples in which transfected tobacco plants express recombinant viral expression 
^5 constructs comprising human a glactosidase nucleotide sequences. The recombinant lysosomal enzymes produced in accordance 
with the invention may be used for a variety or purposes, including but not limited to enzyme replacement therapy for the therapeutic 
treatment of human and animal lysosomal storages diseases. 
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PRODUCTION OF LYSOSOMAL ENZYMES 
IN PLANTS BY TRANSIENT EXPRESSION 

FIELD OF THE INVENTION 
5 This invention is in the field of therapeutic peptides. Specifically this invention 

relates to the production of pharmaceutical peptides and proteins encoded on a recombinant 
plant virus or and produced by an infected plant or produced by a recombinant plant. The 
present invention relates especially to the production of human and animal lysosomal 
enzymes in plants comprising expressing the genetic coding sequence of a human or animal 

10 lysosomal enzyme in a plant expression system. The plant expression system provides for 
post-translational modification and processing to produce recombinant protein having 
enzymatic activity. The invention is demonstrated herein by working examples in which 
transgenic or transfected tobacco plants produce a modified human a galactosidase and a 
glucocerebrosidase, both of which are enzymatically active. The recombinant lysosomal 

1 5 enzymes produced in accordance with the invention may be used for a variety of purposes 
including but not limited to enzyme replacement therapy for the therapeutic treatment of 
lysosomal storage diseases, research for development of new approaches to medical treatment 
of lysosomal storage diseases, and industrial processes involving enzymatic substrate 
hydrolysis. 

20 BACKGROUND OF THE INVENTION 

Lysosomes, which are present in all animal cells, are acidic cytoplasmic organelles 
that contain an assortment of hydrolytic enzymes. These enzymes function in the degradation 
of internalized and endogenous macromolecular substrates. When there is a lysosomal 
enzyme deficiency, the deficient enzyme's undegraded substrates gradually accumulate within 

25 the lysosomes causing a progressive increase in the size and number of these organelles 

within the cell. This accumulation within the cell eventually leads to malfunction of the organ 
and to the gross pathology of a lysosomal storage disease, with the particular disease 
depending on the particular enzyme deficiency. More than thirty distinct, inherited lysosomal 
storage diseases have been characterized in humans. 

30 ENZYME REPLACEMENT THERAPY 

One proven treatment for lysosomal storage diseases is enzyme replacement therapy 
in which an active form of the enzyme is administered directly to the patient. However, 
abundant, inexpensive and safe supplies of therapeutic lysosomal enzymes are not 
commercially available for the treatment of any of the lysosomal storage diseases. There is a 

35 large number of metabolic storage disorders known to affect man. As a group, these diseases 
are the most prevalent genetic abnormalities of humans, yet individually they are quite rare. 
One of the three major classes of these conditions, comprising the majority of patients, is the 
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sphingolipidoses in which excessive quantities of undegraded fetty components of cell 
membranes accumulate because of inherited deficiencies of specific catabolic enzymes. 
Principal disorders in this category are Gaucher disease, Niemann-Pick disease, Fabry 
disease, and Tay-Sachs disease. All of these disorders are caused by harmful mutations in the 
5 genes that code for specific housekeeping enzymes within lysosomes. Thus, to be effective, 
enzyme replacement therapy requires that the requisite exogenous enzyme be taken up by the 
cells in which the materials are catabolized and that they be incorporated into lysosomes 
within these cells. Fabry disease is an ideal candidate for enzyme replacement therapy 
because the disease does not involve the central nervous system. The therapeutic enzyme 
10 . does not need to be delivered across the blood-brain barrier (1,2). 

The effectiveness of enzyme replacement therapy has been dramatically documented 
in the treatment of patients with Gaucher disease. This condition is the most frequent of all 
metabolic storage disorders. It is estimated that there are 1 5,000 patients with this condition 
in the United States and about 80,000 worldwide. Soon after the enzymatic defect in Gaucher 
15 disease was established, consideration was given to the possibility of treating patients with 
purified a -glucocerebrosidase (3). Dr. Brady elected to use human placental tissue as the 
source of enzyme in order to minimize sensitizing patients to the exogenous protein. Initial 
studies with small amounts of glucocerebrosidase injected intravenously into patients with 
Gaucher disease revealed that the exogenous enzyme reduced the quantity of accumulated 
20 glucocerebroside in the liver and in the blood (4). A large-scale enzyme purification 

procedure was developed in order to obtain sufficient quantities for clinical efficacy trials (5). 
It was then learned that modifications of the terminal sugars on oligosaccharide chains of the 
enzyme were necessary in order to target intravenously administered enzyme to macrophages 
where most of the glucocerebroside is stored. Targeting to macrophages was accomplished 
25 by sequential enzymatic removal of monosaccharide residues from glucocerebrosidase 

resulting in mannose-terminal glucocerebrosidase (6). Administration of this glycoform of 
glucocerebrosidase to patients has brought about immense improvement in their condition (7- 
10). The modified enzyme (alglucerasc) is now produced commercially by Genzyme 
Corporation in Cambridge, MA, under the trade name Ceredase™. The beneficial effects of 
30 this treatment have been universally confirmed (11-13). Production of recombinant 

glucocerebrosidase (imiglucerase) is underway in Chinese hamster ovary (CHO) cells, and the 
product (Cerezyme™) is as effective as placental glucocerebrosidase (14). The experience 
with Gaucher treatment validates enzyme replacement therapy with a product that requires 
post-translational modifications. 
35 Fabry disease is caused by deficiencies in the catalytic activity of the lysosomal 

enzyme a galactosidase A (Gal- A). ■ Human Gal-A is a glycoprotein homodimer with a 
molecular weight of approximately 101 kDa containing 5-15% Asn-linked carbohydrate. The 
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enzyme contains approximately equal portions of high mannose and complex type glycans. 
Upon isoelectric focusing, many forms of the enzyme are observed due to differences in 
sialylation depending on the source of the protein (tissue or plasma forms). The disease is 
inherited as an X-linked recessive trait. A number of specific mutations in the gene have been 
5 characterized, including partial rearrangements, splice-junction defects and point mutations. 
Most of these mutations are private and therefore, the gene appears to be highly mutable 
relative to genes encoding other housekeeping enzymes. Defects result in the accumulation of 
glycosphingolipid substrates, globotriaosylceramide and related glycolipids with terminal a - 
galactosidic linkages. Uncatabolized substrate accumulates in the plasma, vascular 
10 , endothelium and various organs leading to an early demise from vascular disease of the heart, 
brain, and kidney, particularly in the classically affected hernizygous males. In addition to 
systemic disease, affected individuals often suffer from peripheral neuropathies and have 
characteristic angiokeratoma of the skin. Heterozygous female carriers may have a more 
attenuated range of disease phenotypes (1,2). 
1 5 Exploratory trials of enzyme replacement therapy for Fabry disease have 

demonstrated the biochemical effectiveness of this approach (15-18). Repeated injections of 
purified splenic and plasma Gal-A reduced the level of plasma substrate and may have 
mobilized stored tissue substrate into circulation. No immunological complications were 
apparent in repeated infusions of enzyme into hernizygous males. Further investigations have 
~ 20 not been attempted because of the great difficulty in obtaining sufficient quantities of enzyme 
for a meaningful replacement trial. The availability of large quantities of enzyme would 
enable optimization of glycoforms for therapeutic efficacy by improving cell targeting and 
prolonging the half-life in circulation and target organs. 

a GALACTOSIDASE 

25 In the early 1 970 ! s, several investigators demonstrated the existence of two .<z.- 

Galactosidase isozymes designated A and B, which hydrolyzed the a. -galactosidic linkages in 
4-MU-and/or jho.-NP-a-D-galactopyranosides (62, 63, 64, 65, 66, 67, 68, 69) In tissues, 
about 80%-90% of total a-Galactosidase (.a.-Gal) activity was due to a thermolabile, 
myoinositol-inhibitable a-Gal A isozyme, while a relatively thermostable, . a.-Gal B, 

30 accounted for the remainder. The two "isozymes" were separable by electrophoresis, 

isoelectric focusing, and ion exchange chromatography. After neuraminidase treatment, the 
electrophoretic migrations and pi value of a-Gal A and B were similar (70), initially 
suggesting that the two enzymes were the differentially glycosylated products of the same 
gene. The rinding that the purified glycoprotein enzymes had similar physical properties 

35 including subunit molecular weight (.about.46 kDa), homodimeric structures, and amino acid 
compositions also indicated their structural relatedness (70. 71. 72. 73. 74. 75. 76. 77). 
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However, the subsequent demonstration that polyclonal antibodies against . a. -Gal A or B did 
not cross-react with the other enzyme (78, 79) that only . a. -Gal A activity was deficient in 
hemizygotes with Fabry disease (80, 81, 82, 83, 84, 85, 86) and that the genes for . a.-Gal A 
and B mapped to different chromosomes (Desnick, et al, 1989, in The Metabolic Basis of 
5 Inherited Disease, Scriver, C. R., Beaudet, A. L. Sly, W. S. and Valle, D., eds, pp. 1751-1796, 
McGraw Hill, New York; deGroot, et al., 1978, Hum. Genet. 44:305-312), clearly 
demonstrated that these enzymes were genetically distinct. 

a-GAL A AND FABRY DISEASE 

In Fabry disease, a lysosomal storage disease resulting from the deficient activity of . 

1 0 a.-Gal A, identification of the enzymatic defect in 1 967 (Brady, et al., 1 967, N. Eng. J. Med. 
276:1 163) led to the first in vitro (Dawson, et al., 1973, Pediat. Res. 7: 694-690m) and in vivo 
(Mapes, et al., 1970, Science 169:987) therapeutic trials of. a.-Gal A replacement in 1969 
and 1970, respectively. These and subsequent trials (Mapes, et al., 1970, Science 169:987; 
Desnick, et al., 1979, Proc. Natl. Acad. Sci. USA 76: 5326; and, Brady, et al., 1973, N. Engl. 

15 J. Med. 289: 9) demonstrated the biochemical effectiveness of direct enzyme replacement for 
this disease. Repeated injections of purified splenic and plasma . a.-Gal A (100,000 
U/injection) were administered to affected hemizygotes over a four month period (Desnick, et 
al., 1979, Proc. Natl. Acad. Sci. USA 76:5326). The results of these studies demonstrated that 
(a) the plasma clearance of the splenic form was 7 times faster than that of the plasma form 

20 (10 min vs 70 min); (b) compared to the splenic form of the enzyme, the plasma form effected 
a 25 -fold greater depletion of plasma substrate over a markedly longer period (48 hours vs 1 
hour); (c) there was no evidence of an immunologic response to six doses of either form, 
administered intravenously over a four month period to two affected hemizygotes; and (d) 
suggestive evidence was obtained indicating that stored tissue substrate was mobilized into 

25 the circulation following depletion by the plasma form, but not by the splenic form of the 

enzyme. Thus, the administered enzyme not only depleted the substrate from the circulation 
(a major site of accumulation), but also possibly mobilized the previously stored substrate 
from other depots into the circulation for subsequent clearance. These studies indicated the 
potential for eliminating, or significantly reducing, the pathological glycolipid storage by 

30 repeated enzyme replacement. However, the biochemical and clinical effectiveness of 

enzyme replacement in Fabry disease has not been commercially available due to the lack of 
sufficient human enzyme for adequate doses and longterm evaluation. 



35 



THE a-GAL A ENZYME 

The . a.-Gal A human enzyme has a molecular weight of approximately 101,000 Da. 
On SDS gel electrophoresis it migrates as a single band of approximately 49,000 Da 
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indicating the enzyme is a homodimer (Bishop & Desnick, 1984, J. Biol. Chem. 256: 1307). . 
ct-Gal A is synthesized as a 50,500 Da precursor containing phosphorylated endoglycosidase 
H sensitive oligosaccharides. This precursor is processed to a mature form of about 46,000 
Da within 3-7 days after its synthesis. The intermediates of this processing have not been 
5 defined (Lemansky, et al., 1987, J, Biol. Chem. 262:2062). As with many lysosomal 

enzymes, .a.-Gal A is targeted to the lysosome via the mannose-6-phosphate receptor. This is 
evidenced by the high secretion rate of this enzyme in mucolipidosis II cells and in fibroblasts 
treated with NH.sub.4 CI. 

The enzyme has been shown to contain 5-15% Asn linked carbohydrate (Ledonne, et 

10 'al., 1983, Arch. Biochem. Biophys. 224:186). The tissue form of this enzyme was shown to 
have .about.52% high mannose and 48% complex type oligosaccharides. The high mannose 
type coeluted, on Bio-gel chromatography, with Man.sub.8-9 GIcNAc while the complex type 
oligosaccharides were of two categories containing 14 and 19-39 glucose units. Upon 
isoelectric focusing many forms of this enzyme are observed depending on the sources of the 

1 5 purified enzyme (tissue vs plasma form). However, upon treatment with neuraminidase, a 
single band is observed (pl-5.1) indicating that this heterogeneity is due to different degrees 
of sialylation (Bishop & Desnick, 1981, J. Biol. Chem. 256:1307). Initial efforts to express 
the full-length cDNA encoding . a.-Gal A involved using various prokaryotic expression 
vectors (Hantzopoulos and Calhoun, 1987, Gene 57:159; Ioannou, 1990, Ph.D. Thesis, City 

20 University of New York). Although microbial expression was achieved, as evidenced by 

enzyme assays of intact E. coli cells and growth on melibiose as the carbon source, the human 
protein was expressed at low levels and could not be purified from the bacteria. These results 
indicate that the recombinant enzyme was unstable due to the lack of normal glycosylation 
and/or the presence of endogenous cytoplasmic or periplasmic proteases. 

25 GAUCHER DISEASE AND TREATMENT 

Gaucher disease is the most common lysosomal storage disease in humans, with the 
highest frequency encountered in the Ashkenazi Jewish population. About 5,000 to 10,000 
people in the United States are afflicted with this disease (Grabowski, 1993, Adv. Hum. 
Genet. 21:377-441). Gaucher disease results from a deficiency in glucocerebrosidase 

30 (hG'CB); glucosylceramidase; acid |3-glucosidase; EC 3.2.1.45). This deficiency leads to an 
accumulation of the enzyme's substrate, glucocerebroside, in reticuloendothelial cells of the 
bone marrow, spleen and liver, resulting in significant skeletal complications such as bone 
marrow expansion and bone deterioration, and also hypersplenism, hepatomegaly, 
thrombocytopenia, anemia and lung complications (Grabowski, 1993, supra; Lee, 1982, Prog. 

35 Clin. Biol. Res. 95:177-217; Brady et al., 1965, Biochem. Biophys. Res. Comm. 18:221-225). 
hGCB replacement therapy has revolutionized the medical care and management of Gaucher 
disease, leading to significant improvement in the quality of life of many Gaucher patients 
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(Pastores et al., 1993, Blood 82:408-416; Fallet et al. f 1992, Pediatr. Res. 31:496-502). 
Studies have shown that regular, intravenous administration of specifically modified hGCB 
(Ceredase.TM., Genzyme Corp.) can result in dramatic improvements and even reversals in 
the hepatic, splenic and hematologic manifestations of the disease (Pastores et al., 1993, 
5 supra; Fallet: et al., 1992, supra; Figueroa et al., 1992, N. Eng. J. Med 327:1632-1636; Barton 
etaL, 1991, N. Eng. J. Med. 324:1464-1470; Beutleret ah, 1991, Blood 78:1183-1189). 
Improvements in associated skeletal and lung complications are possible, but require larger 
doses of enzyme over longer periods of time. 

Despite the benefits of hGCB replacement therapy, the source and high cost of the 

10 .enzyme seriously restricts its availability. Until recently, the only commercial source of 

purified hGCB has been from pooled human placentae, where ten to twenty kilograms (kg) of 
placentae yield only 1 milligram (mg) of enzyme. From five hundred to two thousand 
kilograms of placenta (equivalent to 2,000-8,000 placentae) are required to treat each patient 
every two weeks. Current costs for hGCB replacement therapy range from $55 to $220/kg 

1 5 patient body weight every two weeks, or from $70,000 to $300,0007year for a 50 kg patient. 
Since the need for therapy essentially lasts for the duration of a patienfs life, costs for the 
enzyme alone may exceed $ 1 5,000,000 during 30 to 70 years of therapy. 

A second major problem associated with treating Gaucher patients with 
glucocerebrosidase isolated from human tissue (and perhaps even from other animal tissues) 

20 is the risk of exposing patients to infectious agents which may be present in the pooled 
placentae, e.g., human immuno-deficiency virus (HIV)» hepatitis viruses, and others. 

Accordingly, a new source of hGCB is needed to effectively reduce the cost of 
treatment and to eliminate the risk of exposing Gaucher patients to infectious agents. 

25 HURLER SYNDROME AND TREATMENT 

Hurler syndrome is the most common of the group of human lysosomal storage 
disorders known as the mucopolysaccharidosis (MPS) involving an inability to degrade 
dermatan sulfate and heparan sulfate. Hurler patients are deficient in the lysosomal enzyme, 
a-L-iduronidase (IDUA), and the resulting accumulation of glucosaminoglycans in the 

30 lysosomes of affected cells leads to a variety of clinical manifestations (Neufeld & Ash well, 
1980, The Biochemistry of Glycoproteins and Proteoglycans, ed. W. J. Lennarz, Plenum 
Press, N.Y.; pp. 241-266) including developmental delay, enlargement of the liver and spleen, 
skeletal abnormalities, mental retardation, coarsened facial features, corneal clouding, and 
respiratory and cardiovascular involvement. Hurler/Scheie syndrome (MPS I H/S) and Scheie 

35 syndrome (MPS IS) represent less severe forms of the disorder but also involve deficiencies 
in IDUA. Molecular studies on the genes and cDNAs of MPS I patients has led to an 
emerging understanding of genotype and clinical phenotype (Scott et al., 1990, Am. J. Hum. 
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Genet 47:802-807). In addition, both a canine and feline fornvof MPS I have been 
characterized (Haskins et al, 1979, Pediat. Res. 13: 1294-1297; Haskins and Kaklds, 1 995, 
Am. J. Hum. Genet. 57:A39 Abstr. 194; Shull et aL, 1994, Proc. Natl. Acad. Sci. USA, 
91 : 12937-12941) providing an effective in vivo model for testing therapeutic approaches. 
5 The efficacy of enzyme replacement in the canine model of Hurler syndrome using 

human IDUA generated in CHO cells was recently reported (Kakkis et al., 1995, Am. J. Hum. 
Genet 57:A39 (Abstr.); Shull et al., 1994, supra). Weekly doses of approximately 1 mg 
administered over a period of 3 months resulted in normal levels of the enzyme in liver and 
spleen, lower but significant levels in kidney and Lungs and very low levels in brain, heart, 

10 'cartilage and comea (Shull et al., 1994, supra. Tissue examinations showed normalization of 
lysosomal storage in the liver, spleen and kidney, but no improvement in heart, brain and 
corneal tissues. One dog was maintained on treatment for 13 months and was clearly more 
active with improvement in skeletal deformities, joint stiffness, corneal clouding and weight 
gain (Kakkis et al., 1995, supra. A single higher-dose experiment was quite promising and 

1 5 showed detectable IDUA activity in the brain and cartilage in addition to tissues which 

previously showed activity at the lower does. Additional higher-dose experiments and trials 
involving longer adrninistration are currently limited by availability of recombinant enzyme. 
These experiments underscore the potential of replacement therapy for Hurler patients and the 
severe constraints on both canine and human trials due to limitations in recombinant enzyme 

20 production using current technologies. 

LYSOSOMAL ENZYMES: BIOSYNTHESIS AND TARGETING 

Lysosomal enzymes are synthesized on membrane-bound polysomes in the rough 
endoplasmic reticulum. Each protein is synthesized as a larger precursor containing a 

25 hydrophobic amino terminal signal peptide. This peptide interacts with a signal recognition 
particle, an 1 IS ribonucleoprotein, and thereby initiates the vectoral transport of the nascent 
protein across the endoplasmic reticulum membrane into the lumen (Erickson, et al., 1981, J. 
Biol. Chem. 256:11224; Erickson, et al., 1983, Biochem. Biophys. Res. Commun. 115:275; 
Rosenfeld, et al., 1982, J. Cell Biol. 93:135). Lysosomal enzymes are cotranslationally 

3 0 glycosylated by the en bloc transfer of a large preformed oligosaccharide, glucose-3, 

mannose-9, N-acetylglucosamine-2, from a lipid-linked intermediate to the Asn residue of a 
consensus sequence Asn-X-Ser/Thr in the nascent polypeptide (Komfeld, R. & Kornfeld, S., 
1985, Annu. Rev. Biochem. 54:631). In the endoplasmic reticulum, the signal peptide is 
cleaved, and the processing of the Asn-linked oligosaccharide begins by the excision of three 

3 5 glucose residues and one mannose from the oligosaccharide chain. 

The proteins move via vesicular transport, to the Golgi stack where they undergo a 
variety of posttranslational modifications, and are sorted for proper targeting to specific 
destinations: lysosomes, secretion, plasma membrane. During movement through the Golgi, 
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the oligosaccharide chain on secretory and membrane glycoproteins is processed to the sialic 
acid-containing complex-type. While some of the oligosaccharide chains on lysosomal 
enzymes undergo similar processing, most undergo a different series of modifications. The 
most important modification is the acquisition of phosphomannosyl residues which serve as 
5 an essential component in the process of targeting these enzymes to the lysosome (Kaplan, et 
ah, 1977, Proc. Natl. Acad. Sci. USA 74:2026). This recognition marker is generated by the 
sequential action of two Golgi enzymes. First, N-acctylglucosammyl-phosphotransferase 
transfers N-acetylglucosamine- 1 -phosphate from the nucleotide sugar uridine diphosphate-N- 
acetyl glucosamine to selected mannose residues on lysosomal enzymes to give rise to a 

10 »phosphodiester intermediate (Reitman & Kornfeld, 1981, J. Biol. Chem. 256:4275; Waheed, 
et al., 1982, J. Biol. Chem. 257:12322). Then, N-acetylglucosamme-l-phosphodiester . ct.-N- 
acetylglucosaminidase removes N-acetylglucosamine residue to expose the recognition 
signal, mannose-6-phosphate (Varki & Kornfeld, 1981, J. Biol. Chem. 256: 9937; Waheed, et 
al., 1981, J. Biol. Chem. 256:5717). 

1 5 Following the generation of the phosphomannosyl residues, the lysosomal enzymes 

bind to mannose-6-phosphate (M-6-P) receptors in the Golgi. In this way the lysosomal 
enzymes remain intracellular and segregate from the proteins which are destined for secretion. 
The ligand-receptor complex then exits the Golgi via a coated vesicle and is delivered to a 
prelysosomal staging area where dissociation of the ligand occurs by acidification of the 

2U compartment (Gonzalez-Noriega, et al., 1980, J. Cell Biol. 85: 839). The receptor recycles 
back to the Golgi while the lysosomal enzymes are packaged into vesicles to form primary 
lysosomes. Approximately, 5-20% of the lysosomal enzymes do not traffic to the lysosomes 
and are secreted presumably, by default. A portion of these secreted enzymes may be 
recaptured by the M-6-P receptor found on the cell surface and be internalized and delivered 

25 to the lysosomes (Willingham, et al., 1981, Proc. Natl. Acad. Sci. USA 78:6967). 

Two mannose-6-phosphate receptors have been identified. A 215 kDa glycoprotein 
has been purified from a variety of tissues (Sahagian, et al., 1981, Proc. Natl. Acad. Sci. USA, 
78:4289; Steiner & Rome, 1982, Arch. Biochem. Biophys. 214:681). The binding of this 
receptor is divalent cation independent. A second M-6-P receptor also has been isolated 

30 which differs from the 215 kDa receptor in that it has a requirement for divalent cations. 

Therefore, this receptor is called the cation-dependent (M-6-P.sup.CD) while the 215 kDa one 
is called cation-independent (M-6-P.sup.CI). The M-6-P.sup.CD receptor appears to be an 
oligomer with three subunits with a subunit molecular weight of 46 kDa. 

Biosynthesis of Lysosomal Enzymes. 
35 Although many lysosomal enzymes are soluble and are transported to lysosomes by 

M-6-P receptors (MPR), integral membrane and membrane-associated proteins such as 
human glucocerebrosidase (hGCB) are targeted and transported to lysosomes independent of 
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the M-6-P/MPR system (Komfeld & Mellman, 1989, Erickson^t al., 1985). hGCB does not 
become soluble after translation, but instead becomes associated with the lysosomal 
membrane by means which have not been elucidated (von Figura & Hasilik, 1986, Annu. 
Rev. Biochem. 55:167-193; Komfeld and Mellman, 1989, Annu. Rev. Cell Biol. 5:483-525). 
5 hGCB is synthesized as a single polypeptide (58 kDa) with a signal sequence (2 kDa) at the 
amino terminus. The signal sequence is co-translationally cleaved and the enzyme is 
glycosylated with a heterogeneous group of both complex and high-mannose oligosaccharides 
to form a precursor. The glycans are predominately involved in protein conformation. The 
"high mannose" precursor, which has a molecular weight of 63 KDa, is post-translationally 

1 0 .processed in the Golgi to a 66 KDa intermediate, which is then further modified in the 

lysosome to the mature enzyme having a molecular weight of 59 KDa (Jonsson et al., 1987, 
Eur. J. Biochem. 164:171; Erickson et al., 1985, J. Biol. Chem., 260:14319). 

The mature hGCB polypeptide is composed of 497 amino acids and contains five N- 
glycosylation amino acid consensus sequences (Asn-X-Ser/Thr). Four of these sites are 

1 5 normally glycosylated. Glycosylation of the first site is essential for the production of active 
protein. Both high-mannose and complex oligosaccharide chains have been identified (Berg- 
Fussman et al., 1993, J. Biol. Chem. 268:14861-14866). hGCB from placenta contains 7% 
carbohydrate, 20% of which is of the high-mannose type (Grace & Grabowski, 1990, 
Biochem. Biophys. Res. Comm. 168:771-777). Treatment of placental hGCB with 

20 neuraminidase (yielding an asialo enzyme) results in increased clearance and uptake rates by 
rat liver cells with a concomitant increase in hepatic enzymatic activity (Furbish et al., 1981, 
Biochim. Biophys. Acta 673:425-434). This glycan-modificd placental hGCB is currently 
used as a therapeutic agent in the treatment of Gaucher's disease. Biochemical and site- 
directed mutagenesis studies have provided an initial map of regions and residues important 

25 to folding, activator interaction, and active site location (Grace et al., 1994, J. Biol. Chem. 
269:2283-2291). 

The complete complementary DNA (cDNA) sequence for hGCB has been published 
(Tsuji et al., 1986, J. Biol. Chem. 261:50-53; Sorge et al., 1985, Proc. Natl. Acad. Sci. USA 
82:7289-7293), and E. coli containing the hGCB cDNA sequence cloned from fibroblast 
30 cells, as described (Sorge et al., 1985, supra), is available from the American Type Culture 
Collection (ATCC) (Accession No. 65696). 

Recombinant methodologies have the potential to provide a safer and less expensive 
source of lysosomal enzymes for replacement therapy. However, production of active 
enzymes, e.g., hGCB, in a heterologous system requires correct targeting to the ER, and 
35 appropriate N-linked glycosylation at levels or efficiencies that avoid ER-based degradation 
or aggregation. Since mature lysosomal enzymes must be glycosylated to be active, bacterial 
systems cannot be used. For example, hGCB expressed in E. coli is enzymatically inactive 
(Grace & Grabowski, 1990, supra). 
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Active monomers of hGCB have been purified from inSect cells (Sf9 cells) and 
Chinese hamster ovary (CHO) cells infected or transfected, respectively, with hGCB cDNA 
(Grace & Grabowski, 1990, supra; Grabowski et al., 1989, Enzyme 41:131-142). A method 
for producing recombinant hGCB in CHO cell cultures and in insect cell cultures was recently 
5 disclosed in U.S. Pat. No. 5,236,838. Recombinant hGCB produced in these heterologous 
systems had an apparent molecular weight ranging from 64 to 73 kDa and contained from 5 to 
15% carbohydrate (Grace & Grabowski, 1990, supra; Grace et al., 1990, J. Biol. Chem, 
265:6827-6835). These recombinant hGCBs had kinetic properties identical to the natural 
enzyme isolated from human placentae, as based on analyses using a series of substrate and 

1 0 'transition state analogues, negatively-charged lipid activators, protein activators (saposin C), 
and mechanism-based covalent inhibitors (Grace et al., 1994, supra; Berg-Fussman et al., 
1993, supra; Grace et al., 1990, J. Biol. Chem. 265:6827-6835; Grabowski et al., 1989, supra) 
However, both insect cells and CHO cells retained most of the enzyme rather than secreting it 
into the medium, significantly increasing the difficulty and cost of harvesting the pure enzyme 

1 5 (Grabowski et al., 1989, supra). Accordingly, a recombinant system is needed that can 

produce human or animal lysosomal enzymes in an active form at lower cost, and that will be 
appropriately targeted for ease of recovery. 

ENORMOUS COSTS OF PHARMACEUTICAL ENZYME PRODUCTION 

20 While the clinical treatment of Gaucher patients provides a dramatically successful 

example of an effective therapy, the expense underscores an equally inadequate production 
technology. For example, the present cost for the first year of treatment for a severely 
affected 70 kg patient with Gaucher disease can reach $382,000. If the patient's clinical 
parameters are not restored to normal in that time, treatment at this level of expense will be 

25 prolonged before dose reduction can be initiated. Even with dose reduction, it is likely that 
the maintenance cost for such an individual will be in the range of $135,000 per year (at 
S3.70/IU). Many patients are unable to pay this large cost, and health carriers are extremely 
reluctant to underwrite this treatment for the life of these patients. Cerezyme™ is as 
expensive as Ceredase™ and at this time is available only in limited quantities. The number 

30 of patients with Gaucher disease in the US currently receiving therapy is estimated to be only 
10-15% of the total. According an article in Nature Medicine, since the introduction of this 
therapy six years ago the cost of treating Gaucher patients worldwide will soon approach one 
billion dollars (19). Although the total number of patients worldwide who would benefit from 
therapy is not known with any certainty, it is safe to assume that at least 80% of the world 

35 Gaucher population remain untreated. To quote from the NIH Technology Assessment 
Conference Summary Statement, February 27-March 1, 1995. "As a prototype for all rare 
diseases, the plight of patients with Gaucher disease raises difficult financial and ethical 
issues, which we as a society must address (20)." Fabry disease is estimated to occur at a 
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frequency of 1 in 40,000. Over 400 hemizygous male patients have been clinically described. 
It is imperative that fundamentally new methods of enzyme production be developed to 
reduce these costs so that all who suffer from these rare disorders can be treated. 

MAMMALIAN LYSOSOMES VERSUS PLANT VACUOLES 
5 Because plants are eukaryotes, plant expression systems have advantages over 

prokaryotic expression systems, particularly with respect to correct processing of eukaryotic 
gene products. However, unlike animal cells, plant cells do not possess lysosomes. Although 
the plant vacuole appears functionally analogous to the lysosome, plants do not contain MPRs 
(Chrispeels, 1991, Ann. Rev. PL Phys. PI. Mol. Biol. 42:21-53; Chrispeels andTague, 1991, 

10 Intl. Rev. Cytol. 125:1-45), and the mechanisms of vacuolar targeting can differ significantly 
from those of lysosomal targeting. For example, the predominant mechanism of vacuolar 
targeting in plants does not appear to be glycan-dependent, but appears to be based instead on 
C- or N-terminal peptide sequences (Gomez & Chrispeels, 1993, Plant Cell 5:1 1 13-1 124; 
Chrispeels & Raikhal, 1992, Cell 68:613-618; Holwerda et al., 1992, Plant Cell 4:307-318; 

15 Neuhaus et al., 1991, Proc. Natl. Acad. Sci. USA 88:10362-10366; Chrispeels, 1991, supra; 
Chrispeels & Tague, 1991, supra; Holwerda et al„ 1990, Plant Cell 2:1091-1106; Voelker et 
al., 1989, Plant Cell 1 :95-104). As a result, plants have not been viewed as appropriate 
expression systems for lysosomal enzymes which must be appropriately processed to produce 
an active product. 

20 An object of this invention is to provide the existing patient population with enough 

active enzyme to develop a lower cost treatment. The enzymatic, structural, and glycan 
compositional analyses show rGal to be active. There are recent advances in glycoprotein 
modification and drug delivery that allow, as examples, the chemical conjugation of peptides 
to carbohydrate, the covalent addition of polyethylene glycol to enzymes and the liposomal 

25 encapsulation of protein. Many additional new concepts can be tested to increase the half-life 
of enzymes in circulation and optimize cellular and subcellular targeting. Ideally, these 
modifications will require a facile and rapid genetic system to produce large quantities of 
highly pure enzyme and an effective animal disease model for drug development. Our lab- 
scale process appears highly scalable and is capable of producing grams of enzyme per month 

30 in existing indoor greenhouse growth areas. 

Using a viral transfection system and transgenic plants, we have expressed enzymes 
in plants that have potential as therapeutic agents for humans with the metabolic storage 
disorders known as Fabry disease and Gaucher disease. High specific activity recombinant 
enzymes were secreted by tobacco leaf cells via a default pathway of protein sorting into the 

35 apoplastic compartment, a network of extracellular space, cell wall matrix materials and 

intercellular fluid (IF). We further developed a novel bioprocessing method to purify these 
enzymes from the IF fraction. 
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Another object of this invention is to provide an optimized preproenzymc amino acid 
(AA) sequence for secretion of highly active lysosomal enzymes. Another object of this 
invention is to provide an optimized purification of lysosomal enzymes from either the IF 
5 fraction or from whole plant homogenates. Another object of this invention is to provide a 
molecular characterization of the enzymes purified by this process, including determination of 
enzyme specific activity. 

SUMMARY OF THE INVENTION 
The present invention relates to the production of human or animal lysosomal 
enzymes in transformed or transfected plants, plant cells or plant tissues, and involves 
constructing and expressing recombinant expression constructs comprising lysosomal enzyme 
coding sequences in a plant expression system. The plant expression system provides 
appropriate co-translational and post-translational modifications of the nascent peptide 
required for processing, e.g., signal sequence cleavage, glycosylation, and sorting of the 
expression product so that an enzymatically active protein is produced. Using the methods 
described herein, recombinant lysosomal enzymes are produced in plant expression systems 
from which the recombinant lysosomal enzymes can be isolated and used for a variety of 
purposes.. The present invention is exemplified by virally transfected and transgenic tobacco 
plants with lysosomal enzyme expression constructs. One construct comprises a nucleotide 
sequence encoding a modified human glucocerebrosidase (hGCB). Another construct 
comprises nucleotide sequence encoding a human a galactosidase (a gal or a gal A). Virally 
transfected and transgenic tobacco plants having the expression constructs produce lysosomal 
enzymes that are enzymatically active and have high specific activity. 

The plant expression systems and the recombinant lysosomal enzymes produced 
therewith have a variety of uses, including but not limited to: (1) the production of 
enzymatically active lysosomal enzymes for the treatment of lysosomal storage diseases; (2) 
the production of altered or mutated proteins, enzymatically active or otherwise, to serve as 
precursors or substrates for further in vivo or in vitro processing to a specialized industrial 
form for research or therapeutic uses, such as to produce a more effective therapeutic enzyme; 

(3) the production of antibodies against lysosomal enzymes for medical diagnostic use; and 

(4) use in any commercial process that involves substrate hydrolysis. 

BRIEF DESCRIPTION OF FIGURES 
FIGURE 1 shows a Tobamo virus expression vectors. YFG refers to any foreign 
35 gene. 

FIGURE 2 shows a Tobamovirus expression vector containing the human a 
galactosidase gene or a variant of the gene. 



15 



20 



25 



12 



WO 02/08404 



PCT7US0 1/241 11 



FIGURE 3A shows accumulation by Western Analysis of total plant soluble extract 
anti human GAL-A sera. 

FIGURE 3B shows activity of WT rGAL-A at 8 and 14+ days post inoculation of the 
plant host with a viral vector. 
5 FIGURE 4A shows Western blot analysis of total plant soluble extract anti human 

GAL-A sera 

FIGURE 4B shows activity of WT rGAL-wt and rGAL-wtR at 8 and 14+ days post 
inoculation of the plant host with a viral vector. 

FIGURE 5 shows carboxy terminal modifications to a galactosidase. 
10 • FIGURE 6 shows western blot analysis of the accumulation of 10 carboxy-modified 

rGAL-A variants from interstitial fluid and from total plant homogenate. 

FIGURE 7 shows a comparison of enzymatic activity of the 10 carboxy-modified 
rGAL-A variants. 

FIGURE 8 shows a Coomassie blue stained electrophoresis gel separation of 
1 5 carboxy-modified rGAL-A variants and controls. 

FIGURE 9 shows a Coomassie blue stained electrophoresis gel separation of 
carboxy-modified rGAL-A variants and controls. 

FIGURE 10 shows a schematic representation of rGAL-A secretion from the 
endoplasmic reticulum to the apoplast. 
20 FIGURE 1 1 shows different glycosylation structures of a galactosidase. 

FIGURE 1 2 shows TTODA (rGAL-12R) TMV RNA begins at base 1; 12671 83 
reading frame begins at 69, 34 1 7 is suppressive stop codon, and ends at 4919.30K ORF 
begins at 4903 and ends at 5709. Human a galactosidase A RNA begins at 5703, a amylase 
signal peptide is from 5762-5857; mature human a galactosidase A coding region is 5858- 
25 7036, ToMV virus coat protein and 3 UTR follows. 

FIGURE 13 shows SBS5-rGAL-12R TMV RNA begins at base 1; 126/183 reading 
frame begins at 69, 3417 is suppressive stop codon, and ends at 4919.30K ORF begins at 
4903 and ends at 5709. Human a galactosidase A RNA begins at 5703, complete (signal 
peptide and mature protein coding region) human a galactosidase A gene 5766-7037, TMV 
30 U5 virus coat protein and 3 UTR follows. 

DETAILED DESCRIPTION OF THE INVENTION 
Gal-A is one of many proteins that require glycan site occupancy at N-linked sites to 
achieve proper folding and stability. The ability to successfully target the enzyme in Fabry 
patients is also likely to be glycosylation-dependent. This requirement presently limits the 
35 expression possibilities to eukaryotie cell types. Recombinant proteins synthesized in 
baculovirus and yeast expression systems are often hyperglycosylated and highly 
heterogeneous complicating the preparation of therapeutically effective glycoforms from 
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these sources. The rGal-A synthesized in plants is a relatively homogeneous glycoform as 
analyzed by its SDS-PAGE eiectrophoretic mobility and comigrates with rGal-A produced 
purified from placenta (Fig. 2). The expression results (yield and purity) we have already 
presented are unprecedented in any eukaryote system for a glycosylated enzyme and are not 
5 likely to be achieved in the foreseeable future with transgenic plants or animals. "Crude" 
rGal-A from the leaf IF has a specific activity of over 1,000,000 U/mg of protein, whereas 
CHO, COS-1 and insect cell extracts and supematants are maximally only 10 - 20,000 U/mg; 
(36,41,42). 

Protein pharmaceuticals may vary over five orders of magnitude in unit value and be 

1 0 . required in kg/year quantities. The example of Gaucher disease emphasizes the need progress 
in production phase research. Many additional heritable metabolic disorders, particularly 
those caused by dysfunctional lysosomal enzymes, might be treated by supplementation with 
exogenously produced enzymes. Enzyme replacement using macrophage-targeted human 
glucocerebrosidase has been shown to be extraordinarily beneficial for Gaucher patients. 

1 5 However, the cost of this treatment is very great. If the significant advances in clinical 

research are to be applied on a practical scale, new production technologies will be required to 
deliver bioproducts such as these to those in need at an affordable cost (43). No savings in 
Gaucher treatment costs were realized upon introduction of the recombinant CHO-^ell 
product Cerezyme™ to replace the placental-derived Ceredase™. A significant reduction in 

20 cost requires fundamental changes in both the source of enzyme and process of purification. 

While the existing treatment for Gaucher disease is safe and effective, there are 
. potential contaminants derived from the source material that may pose serious risks. For the 

existing pharmaceutical products, these risks primarily include possibilities of contamination 
with human pathogenic viruses or peptides with potent hormonal activities such as human 

25 chorionic gonadotropin (44). These potential contaminants are not present in plant source 
material. 

The main goal in selecting plants for expression of this protein is the potential for a 
radical reduction in costs. For the RNA-viral mediated synthesis of rGal-A and rGCB in 
plants, this is very likely to be achieved through the synergistic combination of three factors: 
30 Complex crude extracts from various eukaryote cell production systems may be 

replaced with a plant leaf homogenate or IF fractions highly enriched in recombinant product 

Large-scale, sterile, cell fermentation systems and associated media, capitalization, 
and waste costs may be replaced with plant biomass. Production is then inexpensively scaled 
to the quantities desired. 

35 

• The labor and time required to generate transgenic higher plants or animals may be 
replaced with a very rapid and simple plant transformation or plant viral transfection system. 
Modern agriculture can supply a new generation of medicinal plants as a source of 



14 



WO 02/08404 



PCT/US01/24111 



pharmaceuticals — a source that should be as inexpensive and readily available as our 
food, fiber, flavors and chemical feedstocks. 

5 EXPERIMENTAL DESIGN AND METHODS 

Post-Translational Processing and Secretion. Protein secretion to the plant apoplast 
is through a default pathway. In our experience, addition of the rice a-amylase signal peptide 
(a- ASP) sequence at the N-terminus of several recombinant proteins is sufficient to direct the 
protein to the lumen of the ER in tobacco leaves. However, this is not likely to be a rate 

1 o limiting step in protein accumulation and many native signal peptides may function equally 
well in plants. It would be most desirable to include few if any additional AA residues at the 
N terminus after processing. For this reason, we compared expression from the native signal 
peptide encoded in human Gal-A cDNA to that from the foreign a-ASP sequence. 

Mutations m the carboxy-terrninal domain of the Gal-A homodimer have profound 

1 5 effects on enzymatic activity. Several mutations occurring in individuals affected with Fabry 
disease map to this region. Some of these mutations have a dominant negative phenotype. 
When a peptide map was published on Gal-A purified from human lung, the authors noted th€ 
absence of the most carboxy-terrninal predicted fragment and hypothesized the proteolytic 
removal of 26 or 28 AA from this region (39,40). Very recently Miyamura et al. published a 

20 study of the effects of carboxy-terrninal truncations on enzymatic activity in transfected COS- 
1 cells (42). Between 2 and 17 AA residues were removed by introducing stop codons into a 
scries of cDNA constructs. Relative enzyme activity, measured using 4-MUG as a substrate, 
first increased and then decreased as AA were removed. 12 and 17 AA constructs had no 
activity, while 1 1 was the same as wild type. A 4 AA construct yielded the highest activity at 

25 approximately 6.2X the full-length sequence. Because the precise AA sequence of the 

carboxy-terminus of the native human enzyme was never determined, there is insufficient 
information to interpret these results. The carboxy-terrninal domain may affect the 
conformation of the active site either directly or indirectly through a proteolytic maturation 
step and/or assembly and subcellular localization of the active form. Furthermore, it is 

30 important to stress that it is the enzyme activity on galactose-terminal glycosprungolipids that 
is relevant to development of a therapeutic enzyme. 

Plant proteins do not require N-linked oligosaccharides for correct sorting into 
vacuoles (35,37,38). Some vacuolar proteins (osmotin, thaumatin, chitrnase-I, glucanase-I 
and a barley lectin), contain sorting information in a CTPP of 7 to 22 AA in length. For 

35 several of these proteins secreted isoforms are synthesized without a CTPP domain. In other 
cases, experimental deletion of the CTPP results in secretion of the recombinant protein to the 
IF (45-48). Sorting of Gal-A to the lysosome is likely to occur by the well-characterized 
mannose-6-phosphate receptor pathway in mammalian cells. We hypothesize that a 
redundant sorting signal may exist in this carboxy-domain that also serves to reduce 
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enzymatic activity in the ER lumen, golgi and trans-golgi network. This signal appears to 
function in plant cells, presumably for vacuolar localization. 

Scale-up Purification and Analysis. In order to evaluate the performance of larger 
scale process equipment, we designed and had built a custom basket centrifuge fixture for a 
5 laboratory low-speed centrifuge that has a capacity of approximately 1 kilogram leaf material. 
The sensitivity of the fluorescent 4-MUG enzyme assay allowed us to begin to evaluate 
enzyme purification from the leaf IF fraction using the construct rGal-A-SEKDEL. (This 
vector only yields approximately l/50th the activity of rGal-A 12-SEKDEL). Leaf tissue was 
transfected, harvested and infiltrated as described in Section B4 (Experimental Results). 
10 i Galactosidase activity was stable in crude IF extracts and was bound to the hydrophobic 

interaction resin octyl sepharose, and eluted in a descending ammonium sulfate gradient The 
lectin resin concanavalin-A sepharose was also effective, indicating the presence of at least 
one high mannose chain, The enzyme did not bind to a commercially available melibiose 
column (Sigma). 

1 5 We have measured the enrichment provided by the affinity resin ct-galactosy lamine 

Sepharose with a C 12 arm (49). Some or all of the three effective chromatography steps were 
combined as necessary with a size exclusion fractionation to yield highly purified enzyme(s). 
Because our current source of enzyme is so enriched (Fig. 2), and several of the published 
purification steps we have shown to be compatible with the plant IF extracts, we anticipate no 

20 problems in enzyme purification. Pure enzyme preparations were shipped to the laboratory of 
Drs. Roscoe Brady and Gary Murray for evaluation of enzyme activity with l^C-galactose- 
■ labeled cerarnide trihexoside. These colleagues were responsible for the development of the 
therapeutically effective glycoform of glucocerebrosidase used to treat Gaucher disease. For 
this purpose, Biosource has a five year Cooperative Research and Development Agreement 

25 with the National Institute of Neurological Disorders and Stroke, NIH, entitled "Human 
Metabolic Enzymes Produced in Plants, " signed 3/15/94. 

We scaled up the purification of up to four candidate therapeutic enzymes as necessary 
in our indoor greenhouses. In our initial experiment (Table 1), 38 and 48 percent of the total 
rGal-A activity was recovered upon the first infiltration and centrifugation treatment 

30 (Construct rGal-A 1 2-SEKDEL) for a yield of >50 mg of enzyme per kilogram of leaf 
material. Experience with the extraction of glucocerebrosidase from the IF indicates that 
additional enzyme is recovered in a second treatment. In these experiments one leaf was 
collected for each sample from each of two plants. There was considerable plant to plant 
variation in the level of enzyme activity (Table 1). We analyzed more carefully the 

35 accumulation of enzyme activity over time post-inoculation to optimize yields. Our facilities 
are more than sufficient to provide the 1 kilogram quantities of biomass necessary to purify 
nanomoles of enzyme for the following sequence and structural work. Sequence analysis and 
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MALDl-lOtf molecular weight determination was performed as a commercial service by 
Commonwealth Biotechnologies, Inc. N-terminal sequence is by the automated Edman 
degradation. C-terminal sequence is by carboxypeptidase digestion followed with amino acid 
analysis. 

5 

Full-Scale Bioprocess Pilot Plant 

Macroextraction. Large-scale maceration of tissue was accomplished by a 65 hp Rietz 
disintegrator mill The macerated tissue was then separated into a "green juice" fraction and a 

10 , fiber fraction in a Rietz screw press. The fiber fraction was dried in a Cardwell drier. The 
"green juice" was then pH adjusted and heated in a dual plate-and- frame heat exchanger 
system with adjustable holding tube. The process of pH adjustment and heating causes the 
precipitation of the Fl protein complex. The protein was then clarified in a 40 hp. Westphalia 
SA-40 disk stack centrifuge capable of clarification of "green juice" at greater than 20 gallons 

15 per minute (GPM). 

Downstream Processing, The concentrate was then pumped to Clean Room 1 that 
houses the primary ultrafiltration (UF) equipment This equipment was fitted with over 1000 
sq. ft. of spiral wound membranes. Typically, the UF was equipped with 100,000 kDa cut-off 

20 membranes. Virus particles are recovered in the retentate. Lower molecular weight proteins 
are recovered in the permeate. The permeate was fractionated by a second UF system fitted 
with appropriate molecular, weight cut-off membranes. The retentate was processed in Clean 
Room 2. Virus was recovered by polyethylene glycol (PEG) precipitation and centrifugation 
in two Sharpies vertical bowl centrifuges. Final purification of soluble proteins and peptides 

25 was accomplished on a series of chromatography systems. 

Additional Facilities. The facility has other major unit processes available for the 
recovery and purification of plant fractions. There are two Alar diatomaceous earth, rotary 
vacuum filters. One of the filters was in an explosion proof area of the pilot plant that can be 
30 used for solvent extraction. The solvent extraction facility also has a biphasic solvent 
extractor and high efficiency distillation column. Extensive tankage was available both 
indoors and outdoors. Pumps, filters and other process equipment are available at the facility, 
allowing a large margin of flexibility while developing new processes. 

35 Full-Scale Pilot Plant Implementation. The Bioprocess Facility has excellent supporting 

infrastructure. The 900 square foot laboratory was equipped with all the basic tools for 
biochemical and protein analyses including: electrophoresis, gel filtration, HPLC, 
spectrophotometry, basic chromatography, chemical analysis, and sample preparation and 
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preservation, ine full scale pilot plant has approximately 15,000 square feet of additional 
floor space for expansion including a high bay tower. External solvent tanks are placed in 
diked enclosures. Two rapid recovery, high pressure (up to 600 psi) steam generators and a 
large twin screw, oilless compressor are on site. A complete shop and maintenance facility 
5 was present along with walk-in cold room and walk-in freezer. Additional equipment 

includes a ceramic microfiltration system, a spray dryer and an array of tanks, pumps, filters, 
heat exchangers, and agitators. 

Process equipment was fabricated and modified by a group of skilled vendors and 
craftsmen capable of fabricating specialized equipment designed by the company, and has 
10 » excellent field experience working in large scale operations. The proposed infiltration, 

centrifugation, vacuum filtration and downstream processes described below are diagrammed 
in a simplified single-line form in Figure 5. This diagram was derived from the existing 
configuration of the plant (Appendix 2). 

15 Infiltration System. Vacuum infiltration can be accomplished in the field or at the 

processing facility. Development experiments determine the necessity to infiltrate the 
material in the field. A vacuum tank was used as the receiver for the plant tissue after harvest 
by the tobacco cutter. The tissue was conveyed into a trailer-mounted tank capable of full 
vacuum and slurried into an infiltration buffer. Hie Owensboro facility has a trailer capable 

20 of carrying approximately 18,000 lb. This will translated into approximately 1,000 gallons 
per trip to the field. The trailer was fitted with a 2,000 gallon tank capable of full volume and 
evacuated by a gasoline driven vacuum pump. In harvests from 1991-1994, it was the goal of 
the team to have harvested biomass at the processing facility in less than 1 hour after cutting. 
If the tissue can be held for approximately one hour without significant loss of enzyme 

25 activity, the biomass can be brought from the field in the conventional wagon and infiltrated 
at the processing facility. Several large, full vacuum tanks can be employed at the facility to 
increase the total throughput of the plant Two large-scale vacuum pump systems in the plant 
that are currently associated with the Alar rotary vacuum filters can be used for the vacuum 
infiltration process step. 

30 

Basket Centrifugation. The full-scale basket-type centrifuge was a discontinuous batch- 
type system. Leaf tissue can be slurried in, dewatered as a batch, then a scraper system 
discharges the solids to a bottom dump. Large leaves and pieces of tissue can be handled in 
this manner. The potential of placing a vacuum system on the discharge side of the 
35 centrifuge was also be investigated. The centrifuge was a hydraulically driven conventional 
basket centrifuge with a bottom discharge and bowl dimension of 48 inches in diameter and a 
depth of 30 inches. Optimum loadings of the centrifuge in full-scale was determined the 
throughput and cycle times of this process step. 



18 



WO 02/08404 PCT/US01/241 1 1 



Vacuum Extraction. Vacuum extraction can be accomplished in large-scale by a web or 
belt-type vacuum filter system common in the food ingredient business. The "in-plant" 
vacuum systems could also be adapted to operate this type of filter. The plant tissue can be 
placed on this type of filter before or after the centrifugarion step. Some damage of the 
biomass was anticipated during the scraper mediated discharge of the basket centrifuge. The 
discharged material was analyzed for the presence of intracellular components and their effect 
on enzyme activity, recovery and separation. These data determined the position of the 
vacuum filtration step in the process flow. 
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Downstream Processing. The initial UF separation was accomplished by an Alfa-Laval 
custom UF system consisting of six modular housings each containing either three 12 inch 
spiral wound membranes (Amicon type) or one standard 38 inch module. This yields a UF 
system with between 740 and 1 140 square feet of membrane area of typical spiral wound 

1 5 configuration. The ability to interchange housings and replace housings by spool pieces 
gives the system great flexibility in large-scale process development. This system was 
housed in Clean Room # 1. This room is 14 x 18 ft, and is under positive pressure, HEPA- 
filtered air. A second UF system was available in Clean Room #1. This smaller system, built 
by Separations Equipment Technology (SETEC), has the capacity for 320 square feet of spiral 

20 wound membrane. This system was employed for the second separation and diafiltration. It 
was designed for automatic diafiltration. Clean Room # 2 is equipped with a Pharmacia 
Streamline fluid bed gel filtration system equipped with UV and refractive index monitoring 
equipment. This unit was available for chromatography steps. 

An antiserum specific for these xylose- and moose-containing complex glycans was 

25 especially useful in developing an EOS A assay to follow enzymatic deglycosylation. Large 
quantities of purified enzyme facilitate definitive determination of glycosylation structure and 
if necessary provide adequate rGal-A to use as substrate for enzymatic deglycosylation 
reactions. Using Gal-A knockout mice in the laboratory of Dr. Brady at NIH was an 
important genetic tool in developing a therapeutically effective glycoform. We use our 

30 trans fee ted plants as a convenient source of recombinant enzyme for glycan analysis. 
Glycoforms are shown in FIGURE 1 1. 

Plants as a Source of Recombinant Pharmaceutical Proteins. A number of genetic tools 
have been developed during the last decade for the expression of foreign genes in plants. In 
addition to various antibody molecules (21-23), the accumulation of serum proteins (24) and 

35 candidate vaccine products (25-28) has been described in the leaves and other tissues of 

whole plants. We increased the attainable expression levels through the use of chimeric RNA 
viruses. For production of specific proteins, transient expression of foreign genes in plants 
using virus-based vectors has several advantages. These chimeric viruses move quickly from 
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an initial infection site and deliver the recombinant gene to essentially all somatic cells of the 
plant. The gene vectors are premier analytical tools because they allow both high level 
expression and brief cycles of protein modification and testing. A permissive host provides 
high levels of expression and may be used for rapid, large-scale recombinant protein 
5 production in whole plants. 

We validated the performance of plant-based expression systems for the production of 
recombinant proteins and peptides of pharmaceutical significance. In two weeks post- 
inoculation, the ribosome inactivating enzyme a -trichosanthin was over-produced in plants to 
2% of the total soluble protein and had the same specific activity as the enzyme from the 
1 0 » native source (29). Because these products can be obtained from a non-sterile, low input, 
renewable and easily scalable source, the costs of synthesis in plants are negligible. We 
confirmed the performance and containment of the vectors in four field trials (1991, 1994, 
1995, 1996). 

The vectors of the invention are based on chimeras between the 6.4 kb single-stranded 
1 5 RNA genome of tobacco mosaic virus (TMV) and other members of the tobamovirus group. 
Most of the TMV genome encodes overlapping reading frames required for replication and 
transcription (Fig. 1 A). These are located at the 5 ! end of the virus and translated from 
genomic RNA yielding proteins of 126 and 183 kDa. Expression of the internal genes was 
controlled by different promoters on the minus-sense RNA that direct synthesis of 3'- 
20 coterminal subgenomic mRNAs produced during replication. The 30 kDa protein, which was 
required for the virus to move from cell to cell, was produced early and in relatively low 
amounts, whereas the 17.5 kDa coat protein was produced late and usually as the most 
abundant protein in infected cells. Largely because of the strength of the coat protein 
subgenomic promoter, during peak protein synthesis the coat protein can be produced at up to 
25 70% of the total rate of cellular protein synthesis without appreciably reducing host protein 
synthesis (30). 

The entire cDNA of the TMV genome was cloned behind a bacterial phage promoter in 
an E. coli plasmid. Precise replicas of the virion RNA can be produced in vitro with RNA 
polymerase and dinucleotide cap, m^GpppG. This not only allows manipulation of the viral 

30 genome for reverse genetics, but it also allows manipulation of the virus into a gene transfer 
vector. Subgenomic promoters from divergent viral strains can be added to the genome to 
direct the expression of foreign genes. Enormous quantities of mRNA are synthesized and 
delivered directly from the cytoplasm to the ribosome. TMV-based transient vectors offer 
significant advantages over integration of genes into plant chromosomes. By altering the 

35 molecular exclusion limits of the cellular junctions between adjacent plant cells, the vector 
invades virtually every cell of the plant during a period of 2 weeks post-inoculation. For 
many gene products, the recombinant protein accumulates to several percent of the total 
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protein during this brief period of time. In contrast, it was very time consuming and labor 
intensive to generate, select, and breed transgenic plants for recombinant protein production. 
Many of these selections were culled because of poor expression due to position effects or 
gene silencing phenomena. In many more lines, the levels of product accumulation was too 
5 low for development of a viable commercial process. 

EXAMPLE 1 

We have established that a recombinant human lysosomal enzyme (rGCB) synthesized in 

10 » transgenic tobacco has comparable activity to the same enzyme isolated from other native and 
recombinant sources. We also investigated the feasibility and economic advantages of 
purifying large quantities of active rGCB from plants. We designed and fabricated laboratory 
equipment that enabled us to optimize the key initial steps of a purification process in the 
laboratory using kilogram quantities of biomass from our greenhouses. We standardized a 

1 5 series of assays for secretory and intracellular marker enzymes in addition to rGCB assays 
that allowed us to monitor both lab and field expression as well as the purification process. 
Leaf tissue was infiltrated with a suitable extraction buffer while submerged in a large 
vacuum chamber, allowing the solution to reach the leaf intercellular fluid containing rGCB. 
The IF fraction was recovered by centrifugation in a custom collection chamber and "basket" 

20 centrifuge rotor compatible with a conventional Beckman J2-2 1 spindle. rGCB was trapped 
from the dilute IF solution by expanded bed adsorption chromatography using a hydrophobic 
resin and eluted with polyethylene glycol. A second ion exchange chromatography step was 
implemented for an overall yield of 1.7 mg/kg at 41% purity to this stage. These procedures 
were then scaled-up to 1 00 kg during several pilot-process experiments in a field trial using 

25 analogous industrial bioprocess equipment. These results are summarized in the table below. 
Three lots of rGCB were further purified by RP-HPLC and used for carbohydrate profiling 
and composition analysis. In NMR experiments we confirmed that the GCB from the plant IF 
contains an N-linked glycan previously reported to occur in glycoproteins isolated from plant 
seeds and tissue cultures. This type of chain contains the plant-specific carbohydrate linkages 

30 of a 1-2 xylose and p 1-3 fucose on the trimannosyl core. 

Making Transgenic Tobacco Plants to produce glucocerebrosidase 

Several founder plant lines for genetically stable expression of rGCB were generated and 
35 characterized. Under greenhouse conditions individual plants accumulate rGCB to at least 
1.3% of the total protein in the leaf intercellular fluid as estimated from enzymatic assays. 
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This represents a 50-fold enrichment relative to the crude lysosomal fraction of placental 
extracts used as the starting material for the product Ceredase™. 

Transgenic Tobacco Leaves Express Moderate Levels of rGCB. We combined a dual 
promoter from Cauliflower Mosaic Virus (35S), a translational enhancer from Tobacco Etch 
5 Virus and a polyadenylation region from the nopaline synthetase gene of Agrobacterium 
tumefaciens. with the native human GCB cDNA to create plasmid pBSG638 (33; Fig. 1). 
These expression elements are widely used to provide the highest possible constitutive 
expression of nuclear encoded genes. Depending on the nature of individual proteins, these 
vectors can be used to accumulate moderate levels of recombinant proteins in most tissues of 
10 »many plant species. 



TRANSGENIC VECTOR FOR rGCB EXPRESSION 
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Using a standard Agrobacterium-mediated transformation method, we regenerated 93 
independent kanamycin-resistant transformants from leaf discs of four different tobacco 
cultivars (the TO generation). In Western blots of total protein extracts, cross-reacting antigen 
was detected in 46 of these TO individuals with antibody raised against human 
glucocerebrosidase. Specificity of the plant-expressed recombinant enzyme was confirmed 

by hydrolysis of 14 C-radiolabeled glucosylceramide. 



Leaf Disc Transformation with Agrobacterium tumefaciens (59, 60, 61) 
Method: 

25 

1) Transform the T-DNA plasmid into A.t. LBA4404 selecting for the bacterial Ab R 
gene (generally Km at 100 ug/ml). 

2) Pick a single colony into YEB medium plus antibiotic and grow at 2S°C overnight 
(to saturation; often takes a little longer than overnight). 

30 

3) Take aseptic or surface-sterilized Nicotiana tabacum (MD609, Xanthi, SRI, 
Samsun) leaves, remove midrib and cut into leaf "chunks" -1cm 2 . 
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4) With sterile forceps, dip (submerge) the leaf disc iifto the Agrobacterium 
suspension. 

=> Placing the bacterial culture into a small petri dish is convenient. 

5) Remove the leaf disc from the Agrobacterium and place the disc on regeneration 
5 medium. Place the discs so that the underside of the leaf is up. (They seem to do better this 

way, perhaps because of better gas transfer.) 

I use needle-nose forceps to handle the discs, thus introducing small puncture 
wounds into which Agrobacterium can infect; small wounds are good, major damage 
(e.g., crushing) to the disc is bad. 
10 6) Seal plate containing discs with Parafilm® and incubate at 25-28°C, preferably in 

light with a yellow Filter to inhibit UV degradation of the medium. 

7) After 2 days co-incubation, transfer the leaf discs to selective plates (regeneration 
medium plus 500ug/ml Cefotaxime). 

8) After 2 more days, transfer discs to regeneration medium plus 500ug/ml 
15 cefotaxime and 100 ug/ml kanamycin 

9) When normal-looking shoots appear, excise them, taking care not to excise any 
callus, and place in rooting medium. 

=> Callus on the end of the stem generally prevents rooting, and could lead to a 
chimeric set of shoots. 

20 => The lower % agar makes it easier to wash the agar off the roots when trartsferring 

to soil. 

=>If there is time, it is a good practice (when the plants are rooted and growing) to cut 
the shoots off and re- root them. Escapes will generally not root on Km medium. 

1 0) When roots first appear, remove plantlets, wash agar from the roots and plant in 
25 soil medium in small pots. Cover pots with a plastic bag for the first 5 days or so to retain 

humidity and reduce transplantation shock. 

I Liter of Regeneration Medium contains: 
MS Salts 
30 30 g sucrose 

I ml of 0.5 rng/ml nicotinic acid 

1 ml of 0.5 mg/mi pyridoxine HC1 

2 ml of 0.5 mg/ml thiamine HCl 
2 ml of 50 mg/ml inositol 

35 1.5 ml of 0.1 mg/ml IAA 

5.0 ml of 1.0 mg/ml 2-EP-2-iminopurine 
8 gofagar,pH 6.0 
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1 Liter of Rooting Medium contains: 
1/2 x MS Salts 

10 g sucrose 

2 ml of 0.1 mg/mllAA 
8 g agar, pH 5.7 



10 



A deposit at ATCC under the Budapest treaty was made on July 25, 2000 of seed 
from Nicotiana benthamiana MD609, Accession No. . 

According to these expression results the rGCB positive transformants were ranked into 
moderate (A), low (B) and negligible (C) activity groups (Table 2). 



TABLE 2. EXPRESSION OF rGCB IN THE TO GENERATION. 



15 



Number of Specific Activity 



Group Individuals 


Units/me 


A 


13 


130-390 


B 


20 


70430 


C 


13 


24-68 


Control: 


8 


0-10 



Specific Activity is based on hydrolysis of [14C]- glucosylceramide (Units = nmol/hr). 



Plant rGCB is Similar to Macrophage-Targeted Glucocerebrosidase. We found reaction 
20 conditions to preferentially inhibit rGCB enzyme activity in the presence of plant 

glucosidases using the suicide substrate conduritol B -epoxide (CBE). Total glucosidase 
activity, and rGCB activity were measured by hydrolysis of the fluorescent substrate 4- 
methylumbeUiferylglucopyranoside (4-MUG) with and without CBE. Total protein was 
determined by the method of Bradford. Detergents are necessary to solubilize and stabilize 
25 activity of this membrane-associated enzyme. Using a small scale (-100 mg fresh weight) 
extraction procedure, several detergents were compared for yield of enzyme activity and 
purity (including; IGEPAL CA-630, Tween-20, Tween-80, Triton X-100, Triton X-l 14, 
CHAPS, taurocholic acid, cholic acid, deoxycholate and taurodeoxychoiate). Buffer without 
detergent, deoxycholate, taurocholate and cholate below their critical micelle concentrations 
30 (CMC) (0.1%) yielded low units of rGCB. All of the other detergents gave comparable 
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specific activity and yields of total activity with Tween-80 yielding slightly higher activity. 
The dialyzable bile salt, sodium taurocholate and the lower CMC detergent Tween-80 were 
compared at a range of concentrations (0. 1-1% and 0.001-1%, respectively). Tween-80 at 
0.15% and taurocholate at 0.5% gave the best yield and purity. 
5 A number of chromatography steps were evaluated for purification of rGCB from total 

homogenates (Table 1). As is the case for the native placental enzyme, hydrophobic resins 
provide the most significant purification gains. Gel filtration, Con A Sepharose and affinity 
chromatography also worked very well, but some of these approaches may be impractical on 
a large scale. Both anion and cation chromatography may prove useful, but the ideal buffer 
10 . conditions for stabilization of enzyme activity remain to be determined. 



TABLE 1. SUMMARY OF CHROMOTOGRAPHY RESULTS 



Column Matrix 


- . - ,.Tjpe. 


Results 


Octyl Sepharose 4 FF 


Hydrophobic 




Phenyl Sepharose HP 


Hydrophobic 




Phenyl Sepharose 6 FF 


Hydrophobic 


+ 


Butyl Sepharose 4 FF 


Hydrophobic 




Alkyl Superose 


Hydrophobic 




SP Sepharose FF 


Strong Cation 


+/- 


Q Sepharose FF 


Strong Anion 


+/- 


Con A Sepharose 


Lectin Affinity 


+/. 


NHS-Activated Sepharose HP 


Antibody Affinity 


+ 


Sephacryl S-100 HR 


Gel Filtration 


+ 



15 

(+) Effective increase in Specific Activity; (+/-) Needs enzyme activity stabilized; 
(+/--) Variable results; (-) Poor binding. 

The post-translational processing of native glucocerebrosidase (GCR) in human cells 
20 is complex. Two primary translation products are derived from two in-phase start codons. 
These precursors, a 2: 1 mixture of 60 kDa and 57 kDa proteins, are proteolytically processed 
to 55 kDa as they pass into the lumen of. the ER. High mannose and complex glycans are 
subsequently added in the ER and Golgi compartments to yield 62 and 66 kDa glycoforms. 
Finally, exoglycosidases generate a mature 59 kDa lysosomal enzyme. Recall that 
25 glycosylation is required for both enzymatic activity and lysosomal targeting of transfused 

enzyme. Sialic acid, galactose, and N-acetylglucosamine residues are enzymatically removed 
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in vitro by the sequential action of glycosidases to prepare giu«ocerebrosidase for therapy. 
The core pathway for biosynthesis and processing of N-linked complex glycans in plants 
appears identical to that found in animals. There are three known differences which occur 
later in the pathway. Sialic acid is not reported in complex glycans from plants, and the a 1-3 
5 fucose and P 1-2 xylose linkages are unique (34). As analyzed by SDS/PAGE, rGCB has an 
apparent molecular weight of 59 kDa, and comigrates with the mannose-terminal therapeutic 
glycoform. We have not yet detected a significant shift in mobility upon treatment with 
glycosidases (PNGase F, Endo H f a 1-3 fucosidase) in our preliminary gtycosylation analysis 
(Fig. 2). However, the enzyme has an apparent molecular weight increase of 4 kDa over the 

1 0 1 proteolytically processed and unglycosylated form (55 kDa) and must be glycosylated for 
activity. Additional digestions are in progress with a more extensive set of endo- and 
exoglycosidases and known plant glycoprotein controls. N-Glycosidase A is reported to 
hydrolyze all types of N-glycan chains from glycopeptides and glycoproteins. 

The signal peptide of rGCB is processed at the correct site. A very small quantity of 

1 5 protein was prepared for sequence analysis by purification through Phenyl-Sepharose, ConA- 
Sepharose and RP-HPLC to produce a single band on SDS-PAGE comigrating with authentic 
glucocerebrosidase. Hie sequence obtained was consistent with the known sequence of 
processed GCR (Fig. 3), In this particular analysis, the first two positions were not resolved 
because some degradation occurred during sample preparation. Correct proteolytic cleavage 

20 of a signal peptide is also confirmed for a mouse antibody light chain molecule expressed in 
tobacco leaves (35). 



STRUCTURE OF THE N-TERMINUS OF rGCB 



N-tenrunal Amino Acid Sequence 




XXPXIPKSFGY 


rGCB 


from tobacco 




ARPCI PKSFGY 


GCR 


human 





Plant rGCB Accumulates in the Leaf Intercellular Fluid. We localized rGCB to the 
intercellular fluid of the leaf using the following simple experimental design. Leaves 
were removed from the plant at the petiole and slit down the midrib into two equal 
35 halves. To obtain a total cellular homogenate, one group of half-leaves was ground in 
the presence of 4 volumes of detergent extraction buffer (100 mM potassium 
phosphate pH 6, 5 mM EDTA, 10 mM P-mercaptoethanol and 0.5% w/v sodium 
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taurocholate) with a mortar and pestle. To recover the IF, the same enzyme extraction 
buffer was infiltrated into the opposing group of half-leaves by submerging the tissue 
and applying moderate vacuum pressure. After draining off excess buffer, the 
undisrupted half-leaves were rolled gently in Parafilm, placed in disposable tubes and 
5 the IF collected by low-speed centrifugation. The IF fraction is quite clear and non 
pigmented and can be applied directly to Phenyl Sepharose hydrophobic resin. 

The results of a typical experiment are shown in Table 2. The increase in specific 
activity corresponds to a similar increase in the amount of cross-reacting material observed in 
a Western blot and is therefore not an artifact of the enzyme assay in the different fractions. 

10 ' Furthermore, rGCB activity was very stable in crude extracts using this particular detergent 
buffer. The increase in specific activity can therefore be attributed to an enrichment of rGCB 
in the IF relative to the whole cell homogenate. The actual concentration of rGCB. in the IF is 
likely to be much higher, because PAGE analysis of the EF fraction shows some 
contamination with known cytoplasmic markers. The highest specific activity we have 

1 5 measured in an IF sample is 20,000 U/mg. If we assume rGCB has the same specific activity 
as the human enzyme (1.5 X 10^ U/mg), this corresponds to 1.3% of the IF protein obtained 
by this method. 



TABLE 2. LOCALIZATION OF rGCB TO THE INTERCELLULAR FLUID 



Fresh Total Protein Total Protein rGCB Total rGCB Specific %Recover 
WeighVolum- Cone. Proteii Yield Cone. rGCl Yield Activit rGCB X-Fold 
Sample (gr) (ml) (mg/ml' (mg) (mz/zr)(U/mX. (U) (U/gr (U/mg) in IF Purificatior 

InterceUularFlu: 2.48 1.9 0.24 0.45 0.18 720 1368 552 3007 22 18 
Homogenate 2.08 8.1 3.89 31.48 15.13 653 5289 2543 168 

201 



Specific activity is based on the hydrolysis of 4-MUG inhibited by 0.5 mM CBE. Units (U) s 



nmol/hr. Because the amount (in nanograms) of cross reacting material observed in a 
25 quantitative Western blot corresponds within experimental error to the amount (in 
nanograms) of enzyme calculated on the basis of activity, we believe the plant rGCB 
was synthesized with high specific activity. This was a very important and favorable 
indirect estimate of specific activity. The enzyme was purified to homogeneity to 
measure more precisely the actual specific activity. 
30 High Levels of rGCB Expression in Leaf Tissue Induce Gene Silencing. The TO 

individuals described in Table 2 are. by definition hemizygous. They contain various loci 
generated from independent insertion events, having no corresponding insert on the 
homologous chromosome. The thirteen TO individuals from Group A were self-pollinated 
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and assayed for levels of enzyme expression in the Tl generation in order to analyze the 
effects of gene dosage (homozygotes versus hemizygotes) and to identify candidate Tl 
families for future seed increase. Kanamycin-resistant transgenic plants were randomly 
selected from segregating families and analyzed for rGCB expression. The number of 
probable loci was estimated by chi-square analysis of the linked kanamycin-resistant 
phenotype at >95% confidence level. There are several Tl families with a heritable mean 
rGCB activity in the range of 200-300 U/mg (nmol 4-MUG hydrolyzed per hour) in the total 
homogenates that we have selected for further production of the enzyme (Table 3). 



10 . 



TABLE 3. EXPRESSION OF rGCB IN THE Tl GENERATION 









Mean 






Tobacco 


Tl 


Number Specific Activity 


Standard 


Number of 


Cultivar 


Family of Loci 


Units/me 


Error 


Individuals 


Samsun 


963 


2 


294 


25 


23 


Samsun 


881 




242 


22 


16 


MD609 


920 




205 


15 


38 


Xanthi 


902 




202 


17 


5 


Samsun 


883 




201 


18 


13 


Xanthi 


832 




195 


18 


9 


SRI 


826 




184 


16 


40 


SRI 


834 




145 


9 


32 


Xanthi 


851 




140 


15 


5 


Samsun 


837 




129 


16 


8 


Xanthi 


831 




114 


21 


10 


Xanthi 


833 




107 


12 


5 


Xanthi 


807 


lto2 


87 


12 


9 




Controls 


40 


8 


20 



However, of 235 Tl plants analyzed, the single individual having the highest activity and the 
15 single observation of completely null expression were siblings of the Tl family 826. 
Moreover, extracts from 826 were also quantitatively the second highest sample of the 
original 46 analyzed for enzyme activity in the TO generation. By Western blot, we analyzed 
protein extracts from several Tl siblings of this family, including the highest{6!2 U/mg) and 
the- lowest (0 U/mg) and found a clear linear correlation between the amount of cross-reacting 
20 protein at 59 kDa and the activity loaded in each lane. In addition to the 59 kDa band, there 
were also variable amounts of cross-reacting protein at 52 kDa. In the null individual there 
was only the 52 lcDa protein. We never observed this molecular weight species in the TO 
extracts or in any other Tl family. There was no evidence of proteolytic activity in this 
sample as judged by mixing the null sample with high activity extracts and analyzing by 
25 enzyme assays and Western blots after incubation at 37*C. If the apparently truncated rGCB 
was derived from proteolytic cleavage, the protease activity must be both physiologically 



28 



WO 02/08404 PCT/US01/241 1 1 

induced and inactive under these isolation conditions. When the null individual was self- 
pollinated and the T2 generation analyzed, enzyme expression reappeared as in the TI and 
TO. Our working hypothesis was that the tobacco plant is able to limit the expression of the 
foreign enzyme as constitutively expressed from this cDNA construct, and that the threshold 
5 for the stochastic induction of this response during development occurs at an expression level 
corresponding to approximately 600 U/mg specific activity in the crude homogenate. Of the 
lines we created, 826 were able to produce enough mRNA to exceed this threshold in the 
homozygous state. 

The silencing of genes in plants is a recently described phenomenon. Work has been 

1 0 ' done detailing a cellular surveillance mechanism that has apparently evolved to specifically 
degrade excess RNA (36). In one case, specific RNA cleavages near the 3'-end of the 
transcript initiate the removal of the transcript. Our description of the silencing of rGCB 
above 600 U/mg is the first association of silencing with a truncated protein, and may well be 
caused by a specific mRNA (and not protein) cleavage event. Gene silencing may determine 

1 5 an upper limit of expression attainable using constitutive transgene expression. 

We subcloned the cDNA for glucocercbrosidase into a TMV-transient vector and 
cDNA combinations. Transcripts were synthesized in vitro and inoculated directly onto lower 
leaves of whole plants. In each case, there was an additional lag time of about 2 weeks post- 
inoculation before appearance of virus in the upper leaves of the plant and in each case the 

20 viral population no longer carried a significant portion of the gene. We detected no 
significant enzyme activity in either inoculated or systemically infected leaves. Very 
recently, we detected the gene in root tissue and in transfected protoplasts. There appears to 
be an incompatibility with leaf expression under conditions of viral amplification of the rGCB 
mRNA. This incompatibility selects for loss of the sequence from the viral population. 

25 To further investigate the nature of the leaf incompatibility with rGCB expression, we 

built the construct pBSG641. This plasmid contains the rGCB gene substituted into the coat 
protein region. The remaining portion of the entire genome was then placed under control of 
the 35S promoter. The promoter was designed to initiate RNA synthesis such that the correct 
5'-end of TMV would be synthesized. A custom-designed, self-cleaving ribozyme sequence 

30 positioned at the end of the genome yields a native 3'-end upon cleavage. The vector was 

designed for synthesis of infectious transcripts in vivo from a chromosomally integrated locus 
and production of rGCB through viral amplification of subgenomic mRNA in the cytoplasm. 
The vector alone without the gene for rGCB produces a systemic but capsid- free, "naked- 
RNA" infection (38). This RNA co-suppression is the subject of issued U.S. Patent No. 

35 5,922,602 issued July 13, 1999, the disclosure of which is incorporated herein by reference. 
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VIRAL VECTOR FOR rGCB EXPRESSION 



PBS0641 

8651 bp 



CaMV 35S 



promoter ' gaggGTAX - 




126KD/184KD 




Pac( 



Pme I 



BslXI 



10 



15 



20 



25 



We introduced this construct into Agrobacterium and transformed tobacco plants as 
described above. In this case many of the plant leaves displayed necrotic lesions as 
transfection events randomly occurred during growth and development and expansion of 
leaves. These lesions never formed on control transformed plant lines containing vector only 
sequences capable of replication. These lesions were identical in appearance to the types of 
lesions induced by plant pathogens during a type of disease resistance reaction, termed the 
hypersensitive response (HR). Therefore, under conditions where we expect to accumulate 
large quantities of active enzyme, an HR is signaled by some component of the vector 
infection specific to rGCB. There are very few of these so-called HR "elicitors" characterized 
in the literature. Possibly the rGCB enzyme itself or a secondary metabolite resulting from 
enzymatic activity, or even rGCB RNA, may induce the HR. In any case, we hypothesize that 
the HR selects for loss of the gene from the viral RNA population. It is important to 
remember that this is not a simple genetic instability phenomenon. Under conditions where 
an HR is not induced, we have synthesized many proteins using TMV-based RNA viral 
vectors to levels of several percent of the total soluble cell protein without loss of the inserted 
gene even after virion passage. 

Expression of rGCB in Transgenic Tobacco is Robust. In several experiments, we 
inoculated wild type TMV onto rGCB containing transgenic tobacco and found a -1.5-2 fold 
increase in the specific activity of total homogenates. It appears that the viral infection causes 
an increase in promoter activity, and/or the secretion and accumulation of active enzyme. 
This was an important result, because it demonstrates that the expression was compatible with 
a TMV infection, a physiologically severe stress condition. Furthermore, in separate work, 
we have used chimeric TMV particles as recombinant carriers for the production of small 
peptides (31). 

Conclusions. 

We used a wide range of gene expression tools to investigate the accumulation of 
rGCB in mature tobacco plants. Our results suggest attractive yield, quality and cost 
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objectives can be met with further development. We observeoVtwo independent phenomena 
currently limiting the accumulation of enzyme activity in whole plants; gene silencing in one 
transgenic line, and a plant leaf hypersensitive response to transient vector mediated 
synthesis. 

These current limitations in gene expression only serve to underscore the advantages 
and utility of agriculture for recombinant protein production. We have generated several 
transgenic tobacco lines as a reliable source of biomass for the production of high specific 
activity enzyme. Because the biomass is accumulated under no sterile growth conditions and 
production is inexpensively scaled to the quantities desired, it becomes feasible to exploit a 
. dilute but enriched source such as the intercellular fluid fraction for industrial process 
development. This contrast is most clearly summarized in Table 4. 

TABLE 4. INITAL STEPS IN THE PURIFICATION OF GLUCOCEREBROSID ASE 



PLACENTA HOMOGENAT TOBACCO HOMOGENAT TOBACCO LEAF IF 

Specific Specific Specific 

Purification Activity Activity Recovery Activity Activity Recovery Activity Activity Reco\ 

Procedure Units/kg Units/mg % Units/kg Units/mg % Units/kg Units/mg % 

Detergent 

Extraction 1,510,000 375 100 1,870,000 230 100 877,000 9,967 10C 
Concentration/ 

Delipidation 707,000 9,330 47 1,540,000 14,400 82 U U U 
Hydrophobic 

Chromatography 554,000 147,000 36 1,242,000 82,000 74 535,000 34,547 61 



The placental homogenate procedure is adapted from Furbish et al., (10) starting with a 
14,000 x g sedimented material. In a typical preparation 1 5-30 kg of fresh placentas were 
processed. The tobacco homogenate is based on the average of 2 typical 1 kg extractions of 
the leaf biomass. The IF data is from an average of 5 small scale extraction experiments (2- 
200 grams fresh weight), and a single chromatography run of an IF concentrate. For 
comparative purposes all yields are normalized to 1 kg. 
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TABLE 5 



10 



15 



GCB IF PILOT PROCESS 

Greenhouse Scale (1 kg) Field Scale (100 kg) 

Specific Total Specific Total 

Purification Activity Activity Recovery Purification Activity Activity 

Recovery Purification 
Step Units/kg Units/mg % Fold Units/kg Units/mg % 

Fold 



IF 4,153,533 20,388 100 1.0 

1.0 

Phenyl SL 3,738,180 147,813 91.4 7.25 
5.0 

SP Big Beads 2,740,086 650,377 67.0 31.9 
38.2 



434,927 2,745 100 



194,722 12,960 44.4 



145,060 99,220 33.1 



20 For comparative purposes all yields are normalized to 1 kg. 

The greenhouse/laboratory scale process is based on an average of 2 
infiltration/chromatography runs starting with 1 kilogram of fresh weight leaf tissue. 

The GenBank accession No. for glucocerebrosidase is Ml 1080. The field scale process is 
an average of 7 large scale infiltrations consisting of 100 kilograms of fresh weight tissue. 
25 Enzyme activity is based on the cleavage of 4-methylumbelliferylglucoside (1 Unit = 1 
nmol/hr). One factor contributing to a lower apparent yield in the field on a fresh weight 
basis is that rGCB is concentrated in the leaf lamina and in the lab scale procedure the midrib 
was removed. 

30 BIOSOURCE TECHNOLOGIES STANDARD OPERATING PROCEDURE 

(No. GCB35) 

Preparation of Solutions for GCB Assay with 4-Methylumbelliferyl fl-D-glucopyranoside 



35 I . GCB Assay Buffer 

0.1 MPotassium Phosphate, 0. 15% Triton X- 100, 0.125% sodium taurocholate (Sigma T- 
4009), 0.1% bovine serum albumin, 0.02% sodium azide, pH 5.9 
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Dissolve 13.6 grams of potassium phosphate monobasic (KH2PO4) in 950 ml of distilled 
water. Add 1 .25 g of sodium taurocholate and 1 .5 g of Triton X-100. Triton X-100 is a very 
viscous liquid and should be weighed rather than pipetted in order to achieve a reproducible 
buffer. Add 2 ml of 10% sodium azide and I gram of bovine serum albumin (BSA). Stir 
5 until all material has dissolved. Adjust the pH to 5.9 by the addition of a small amount of 1 N 
NaOH, then bring up to 1000 ml with water. 

Filter sterilize and store at 4°C. This buffer is stable for many months. 

2. Stopping Buffer 
10 ' 0.1 M Glycine in 0. 1 M NaOH 

Dissolve 4 grams of NaOH and 7.51 grams of glycine in 1 liter of distilled water. Filter 
sterilize and store at 4°C. (Stable for years at 4°C). 

15 3. Substrate (Sigma M-3633) FW 338.3 

15 mM 4-methylumbelliferyl (3-D-glucopyranoside (4-MUG) in assay buffer 

Weigh out 1 gram of 4-MUG into a 500 ml Erlenmeyer flask. Add exactly 197 ml of Assay 
Buffer (Substrate dilution Buffer) and heat in a hot water bath to dissolve. Caution: Heating 
20 too aggressively results in unacceptably high background fluorescence. 

After cooling, dispense into 5-7 ml aliquots in 15 ml polypropylene tubes, let tubes cool to 

room temperature and freeze at -20°C for later use. 

4. 125 mM Conduritol P-epoxidc (CBE) (Toronto Research C-66600) 
25 MW- 162.18 

Dissolve 100 mg of CBE in 4.92 ml of 0.1 M KPO4 Buffer, pH 6.0. Dispense into 200 -500 
ul aliquots and store at -20°C. 

5. 0.1 M KPO4 Buffer, pH 6.0 

30 1 .75 ml of 0.5 M KH2PO4 (Solution A) 87.5 mM 
0.246 ml of 0.5 M K2HPO4 (Solution B) 12.3 mM 
Add distilled water to 10 ml. 

Reagents: 

35 Potassium Phosphate Monobasic (KH2 p 04) Fisher Scientific P285 
Potassium Phosphate Dibasic (K2HPO4) Fisher Scientific P288 
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Triton X-100 Sigma X-100 
Sodium Taurocholic Acid Sigma T-4009 
Bovine Serum Albumin, Fraction V Sigma A-2153 
Sodium Azide Sigma S-2002 
5 Glycine Sigma G-4392 

Sodium Hydroxide Pellets Fisher Scientific S3 1 8 

Conduritol p-epoxide (CBE) Toronto Research C-66600 MW = 162.18 
4-MethylumbelIiferyl B-D-glucopyranoside (0-D-glucoside) Sigma M-3633 

10 • 

BIOSOURCE TECHNOLOGIES STANDARD OPERATING PROCEDURE 

(No. GCB36) 



GCB Assay with 4-Methylumbelliferyl fl-D-glucopyranoside (MUG) 



15 



1.0 PURPOSE 

To measure the amount of glucocerebrosidase activity from transgenic tobacco plants 
following infiltration and/or homogenization of the tissue. Measurement of fiuorometric 
activity requires an accurate determination and relationship between fluorescence of the 
20 released methylumbelliferone and its concentration under the assay conditions. 

SCOPE • 

This is an inhibition assay. CBE inhibits human GCB. The fluorescent value used to 
calculate activity is based on the difFerence in values with and without inhibitor present The 

25 fluorescent value with CBE (plant glucosidase) is subtracted from the fluorescent value 
without inhibitor which is both plant and human GCB activity. The difference being the 
value for human GCB expressed in the transgenic plants. The assay is carried out using 5 ul 
of sample with 45 u,l assay buffer +/- CBE at 37°C. This means that 2 tubes are needed per 
sample. This procedure is applicable to the Glucocerebrosidase assay procedure requiring a 

30 methylumbelliferone standard curve. 

EQUIPMENT 

Fluorometer (St. John Associates Fluoro-Tec 2001 A with KV-418 filter and 365 nm 
interference filter) 
35 10x75 mm cuvettes (St. John Associates) 
Water Bath 

Test Tubes 13x100 mm glass (VWR or equivalent) 
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4-Methylumbelliferone (Sigma M-1381) » 
Pipettes and pipette tips, 5 ul - 1 ml (Rainin or equivalent) 
1 .5 ml micro fiage tubes 

5 PRECAUTIONS 

The fluorometer should be warmed up for at least 20 minutes prior to 
reading samples. The power switch should be left on at all times. If the power 
switch was turned off it may take longer (up to 1 hour) for the instrument to 

stabilize. 

10' Be certain to put away all reagents under proper storage conditions after 

reading assays. Any left over fluorescent substrates and CBE stock should be 
returned to -20°C. The fluorescent substrate and CBE stock can be frozen and 
thawed numerous times without any breakdown of the reagents. Do not save any substrate or 
assay buffer to which you added CBE. These should be discarded You should only make up 

1 5 enough reagent with the inhibitor (CBE) that you currently need. 

PROCEDURE 

1. Turn on the water bath and check to be certain the temperature is set to 37°C. 

2. Turn on fluorometer to warm up by flipping up the PMT switch. It should be 

20 turned on at least 15-20 minutes before taking readings. If the power switch was turned off 
it may take longer (up to I hour) for the instrument to stabilize. 

3. Remove assay buffer from refrigerator and CBE (conduritol p epoxide) from - 

20°C freezer. Thaw CBE on ice. 

4. Defrost the appropriate amount of methylumbelliferyl B-D-glucopyranoside 

25 (MUG) substrate. You need 400 ul of MUG for each sample (+/-CBE). Place tubes in 37°C 
H2O bath for approximately 10 minutes to get MUG into solution. Note: There may be a 

small amount of insoluble material (MUG) in each tube even after the 10 minutes at 37°C. 
Vortex before use. Keep at room temperature until ready to use. 

5. Remove enough GCB Assay Buffer from the stock bottle that will be needed for 
30 your samples and transfer to a 15 ml tube. Add appropriate amount of 125 mM stock solution 

of CBE (conduritol (3 epoxide) to assay buffer so the final concentration is 0.55 mM CBE. 
CBE stock is stored at -20°C. Thaw some CBE on ice now if you have not already done so. 
The assay buffer +CBE may be kept at room temperature if assays will be completed within 1 
hour otherwise store solutions with CBE on ice. 
35 Note; you want the final cone, of CBE in the assay to be 0.5 mM, you are 

adding 45 ul of buffer to 5 ul of sample so the starting cone, of CBE should be 0.55 mM. 
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Example: Add 8.8 ul of 1 25 mM CBE stock solution to 1 .99 1'ml of assay buffer to equal 
0.55 mM CBE in assay buffer. This is enough buffer to carry out at least 40 assays, 

6. Label two 13 x 100 mM glass tubes for each sample to be assayed with a number 
and the same number and a "+" sign . ( 1, 1+, 2, 2+, etc). Tubes with "+"contain CBE, tubes 

5 with just a number will not have CBE added to the assay buffer or MUG. 

Carry out the following steps on ice. 

7. Place numbered tubes in white racks and place in Styrofoam ice chest filled with 
10 • enough ice and H2O (Ice Bath) to cover the volume of fluid in these tubes. 

8. Add 45 ul assay buffer to all the tubes with numbers only. 

Add 45 ul of assay buffer + CBE to all the "+" tubes. 

Place pipette tip in bottom of tube to deliver assay buffer to bottom of tube. 

9. Remove metal tip ejector from P- 10 pipetman (so pipetman will reach bottom of 
i 5 tube) and pipette 5 u.1 of sample into each set of appropriate tubes of assay buffer (one 

and one tube with a number only (-CBE) for each sample or twice this number if done in 

duplicate). Place pipette tip in bottom of tube to deliver sample directly into assay buffer. 

This is very important since you will be pipetting small volumes into these tubes. 

Be sure to include a Buffer sample to blank the fluorometer. This will contain 5 ul of the 
20 same buffer that the samples are in. 

Note: See Dilution of Enzymes if your sample is too concentrated to read in the 

fluorometer. Basically, dilute your sample 1:5, 1:10, etc. in assay buffer, mix well, pulse in 

microfuge and add 5 ul of diluted sample to assay buffer above. You should only need 5-10 

ul of your sample for the dilutions. 
25 10. Incubate tubes at 37°C for 10 minutes then place tubes immediately on ice. 

11. Aliquot the volume of MUG needed for tube assays (200 ul per assay + 
CBE) into a 15 ml polypropylene tube and add CBE stock to a final concentration of 0.5 mM. 

- 4 ul of 125 mM CBE per I ml MUG in assay buffer = 0.5 mM CBE 

12. Add MUG +/- CBE to appropriate tubes. 

30 Add 200 ul of MUG without CBE to all the tubes with a number only (-CBE). 

Add 200 ul of MUG + CBE to all the tubes. 
Place a plastic cap over each tube. Vortex sample and place back on ice. 

13. Transfer all tubes to 37°C H2O bath. Incubate at 37°C for 15 minutes with 
shaking. Set the shaker between the # 4 - 6 settings, 

35 14. Transfer tubes to ice bath. Quickly add 1 ml of stopping solution to each sample. 

Remove rack of samples from ice. 
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15. You are now ready to read samples in fluorometep. Refer to SOP GCB37 for 
Fluorometer Instructions. 

16. Be certain to put away all reagents under proper storage conditions after reading 

assays. Any left over fluorescent substrates should be returned to -20°C. The fluorescent 
5 substrate and CBE stock can be frozen and thawed numerous times without any breakdown of 
the reagents. Do not save any substrate or assay buffer to which you added CBE. These 
should be discarded. You should only make up enough reagent with the inhibitor (CBE) that 
you currently need. 

1 0 ' Dilution Of Samples for Assays 

Routine dilutions of samples should be carried out on ice using the GCB Assay 
Buffer described above to maintain enzyme activity. Generally a 1:5 or 1:10 dilution of 1 the 
sample is sufficient. Dilutions should be carried out in a microfuge tube. ( Example: 1:10 
dilution: 5 ul of sample in 45 ul of assay buffer, mix well and pulse sample in microfuge to 

1 5 bring all of the sample to the bottom of the tube). You should only need 5 - 1 0 ul of your 
sample for the dilutions. 

EXAMPLE 2 

Extraction of Glucocerebrosidase protein 

20 Glucocerebrosidase (GCB), either derived from human placental tissue or a 

recombinant form from Chinese hamster ovary cells (CHO), is presently used in an effective . 
but costly treatment of the heritable metabolic storage disorder known as Gaucher disease. 
We combined a dual promoter from Cauliflower Mosaic Virus (35S), a translational enhancer 
from Tobacco Etch Virus and a polyadenylation region from the nopaline synthetase gene of 

25 Agrobacterium tumefaciens with the native human GCB cDNA to create plasmid pBSG638. 
These expression elements are widely used to provide the highest possible constitutive 
expression of nuclear encoded genes in plants. The CaMV promoter is further inducible by 
stress or wound treatment. 

Using a standard Agrobacterium-mediated transformation method, we regenerated 93 

30 independent kanamycin-resistant transformants from leaf discs of four different tobacco 

cultivars (the TO generation). In Western blots of total protein extracts, cross-reacting antigen 
was detected in 46 of these TO individuals with antibody raised against human 
glucocerebrosidase. Specificity of the plant-expressed recombinant enzyme was confirmed 
by hydrolysis of 14C-radiolabeled glucosylceramide. According to these expression results 

35 the rGCB positive transformants were ranked into moderate (A), low (B) and negligible (C) 
activity groups. 
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We also found reaction conditions to preferentially inhibit rGCB enzyme activity in 
the presence of plant glucosidases using the suicide substrate conduritol B-epoxide (CBE). 
Total glucosidase activity, and rGCB activity were measured by hydrolysis of the fluorescent 
substrate 4-methylumbelliferylglucopyranoside (4-MUG) with and without CBE. Leaves 
5 from plants transfected with the vector TT01 A 103L were removed at the petiole and slit 
down the midrib into two equal halves. To obtain a total cellular homogenate, one group of 
half-leaves was ground in the presence of 4 volumes of detergent extraction buffer (100 mM 
potassium phosphate p(I 6, 5 mM EDTA, 10 mM, B-mercaptoethanol and 0.5% w/v sodium 
taurocholate) with a mortar and pestle after freezing the tissue in liquid nitrogen. To recover 

10 'the intercellular fluid (IF), the same enzyme extraction buffer was infiltrated into the opposing 
group of half-leaves by submerging the tissue and applying* moderate vacuum pressure (500 
mm Hg). After draining off excess buffer, the undisrupted half-leaves were rolled gently in 
Parafilra, placed in disposable tubes and the intercellular fluid (IF) was collected by low- 
speed centrifugation (1,000 g). The weight of buffer recovered from the infiltrated leaf tissue 

1 5 is recorded and varies from approximately one-half to equal the original weight of the leaf. 
GCB expression in IF extracts was quantified using a commercially available enzyme assay 
reagents and protocol. Total protein was determined by the method described by Bradford 
Bradford, M. Anal. Biochem 72:248 1976. 

We have demonstrated that active recombinant GCB may be successfully extracted 

20 from the intercellular fluid of plant leaves using the present method. The GCB assay is based 
on MUG hydrolysis in the presence of CBE. The IF method results in a recovery of 22% of 
the total GCB activity of the leaf at a 18-fold enrichment relative to an extract obtained by 
homogenization (H). The GCB production results may be improved by optimizing the time 
post-inoculation with the viral vector and minimizing the contaminating viral coat protein 

25 from the intercellular fraction. 



EXAMPLE 3 

Laboratory Pilot Scale Purification of Glucocerebrosidase from the Intercellular Fluid of 
Tobacco 

30 MD609 leaf tissue (1-2 kilograms) of transgenic tobacco expressing the lysosomal 

enzyme glucocerebrosidase was harvested, the mid vein removed and the tissue weighed. 
Tissue was submerged with 2-4 volumes of buffer (0. 1 M KPO4 buffer, pH 6.0, 5 mM 
EDTA, 0.5% taurocholic acid, 10 mM fl-mercaptoethanol) in an infiltration vessel that 
accommodates several kilograms of leaf tissue at one time. A perforated metal plate was 

3 5 placed on top of tissue to weigh down the tissue. A vacuum of 25-27 in. Hg was applied for 
1-2 minutes X3. The vacuum was released between subsequent applications. Tissue was 
rotated and the vacuum reapplied to achieve complete infiltration. Multiple applications of 
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the vacuum without isolating the intercellular fluid constitutes* single infiltration procedure. 
An indication of complete infiltration is a distinct darkening in color of the underside of the 
leaf tissue. Excess buffer on the tissue was drained. The intercellular fluid was released from 
the tissue by centrifuging the tissue in a basket rotor at 4200 RPM (2500 x g) for 10 minutes. 
5 The intercellular fluid was collected using an aspirator hooked up to a vacuum pump (IF-1). 
Alternatively, the leaf tissue can be re-infiltrated by placing the leaves back in the infiltration 
vessel in the same buffer used above and the process repeated (IF-2). The second infiltration 
does not require as many applications of the vacuum. Additionally, the buffer may be drained 
from the infiltration vessel (spent buffer) and pooled with the 1st and 2nd IF fractions. 

1 0 . Collectively, IF- 1 , IF-2 and Spent Buffer constitutes the IF pool. The volume of intercellular 
fluid collected from the infiltrated leaf tissue was between 50 - 100% of the leaf tissue by 
weight depending on the number of infiltrations carried out 

Recombinant GCB was purified by loading the dilute intercellular (feed stream) 
directly on a Pharmacia Streamline 25 column containing Phenyl Streamline resin. Expanded 

1 5 bed chromatography enabled us to capture, clarify and concentrate our protein in one step 

without the need for ccntrifugation and/or microfiltration steps. The column was equilibrated 
and washed until UV-signal on recorder returned to baseline with 25 mM citrate, 20% 
ethylene glycol, pH 5.0 and then eluted with 25 mM citrate, 70% ethylene glycol. The eluted 
material was further purified on a cation exchange resin, SP Big Beads (Pharmacia), 

20 equilibrated in 25 mM citrate, 75 mM NaC 1 , pH 5 .0. GCB was eluted with either a step 

gradient of 25 mM citrate, 0.5 M NaCl, 10% ethylene glycol, pH 5.0 or a linear gradient of 
75 mM - 0,4 M NaCl in 25 mM citrate,.pH 5.0. All chromatography steps were carried out 
at room temperature. 

Using the suicide substrate, conduritol B-epoxide (CBE), inhibition of recombinant 
25 glucocerebrosidase (rGCB) activity in the presence of plant glucosidases was achieved. 
Enzyme activity was measured at 37°C in a reaction mixture containing 5 mM 
methylumbelliferyl B-D glucoside, 0.1 M Potassium Phosphate, 0.15% Triton-XlOO, 0.125% 
sodium taurocholate, 0.1% bovine serum albumin, pH 5.9 with and without CBE. Total 
glucosidase activity and rGCB activity were measured by hydrolysis of the fluorescent 
30 substrate 4-methylumbelliferyl glucopyranoside. One unit of activity is defined as the amount 
of enzyme required to catalyze the hydrolysis of 1 nmol of substrate per hour. Total protein 
was determined using the Bio-Rad Protein Assay based on the method of Bradford (Bradford, 
M. Anal. Biochem. 72:248; 1976). 

Typically from 1 kilogram of leaves where IF-1 alone was collected we obtained 4 
3 5 million unites of GCB at a specific activity of 20,000. The Units /kg increased to 6 million 
with a lower specific activity of 10,000 when IF Pool was collected (IF-1, IF-2 and spent 
buffer). For more information on these experiments, see U.S. Serial Nos. 09/132,989 and 
09/500,554. The disclosures of which are incorporated herein by reference. 
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EXAMPLE 4 

Ultrafiltration/Concentration of Intercellular Fluid from Tobacco Expressing 
Glucocerebrosidase 

5 2.3 kilograms of MD609 leaf tissue from transgenic tobacco expressing the lysosomal 

enzyme glucocerebrosidase was harvested, the mid vein removed and the tissue weighed. 
Tissue was submerged with 2-4 volumes of buffer (0.1 M KPO4 buffer, pH 6.0, 5 mM 
EDTA, 0.5% taurocholic acid, 10 mM B-mercaptoethanol) in an infiltration vessel that 
accommodates several kilograms of leaf tissue at one time. A perforated metal plate was 

10 ' placed on top of tissue to weigh down the tissue. A vacuum of 25-27 in. Hg was applied for 
1-2 minutes X 3. The vacuum was released between subsequent applications. Tissue was 
rotated and the vacuum reapplied to achieve complete infiltration. Excess buffer on the tissue 
was drained. The intercellular fluid was released from the tissue by centrifuging the tissue in 
a basket rotor at 4200 RPM (2500 x g) for 10 minutes. The intercellular fluid was collected 

1 5 using an aspirator hooked up to a vacuum pump (IF-1). The leaf tissue was re-infiltrated by 
placing the leaves back in the infiltration vessel in the same buffer used above and the process 
repeated (IF-2). The buffer was drained from the infiltration vessel (spent buffer) and pooled 
with the 1st and 2nd IF fractions. Collectively, IF-1, IF-2 and Spent Buffer constitutes the IF 
pool. The IF pool was filtered through Miracloth and then concentrated 6 fold by passing the 

20 IF pool through a 1 sq. ft. spiral membrane (30K molecular weight cutoff) using an Amicon 
RA 2000 concentrator equipped with an LP-1 pump. 

Using the suicide substrate, conduritol B-epoxide (CBE), inhibition of recombinant 
glucocerebrosidase (rGCB) activity in the presence of plant glucosidases was achieved. 
Enzyme activity was measured at 37°C in. a reaction mixture containing 5 mM 

25 methylumbelliferyl B-D glucoside, 0.1 M Potassium Phosphate, 0.15% Triton-XlOO, 0.125% 
sodium taurocholate, 0.1% bovine serum albumin, pH 5.9 with and without CBE. Total 
glucosidase activity and rGCB activity were measured by hydrolysis of the fluorescent 
substrate 4-methylumbelliferyl glucopyranoside. One unit of activity is defined as the amount 
of enzyme required to catalyze the hydrolysis of 1 nmol of substrate per hour. Total protein 

30 was determined using the Bio-Rad Protein Assay based on the method of Bradford (Bradford, 
M. Anal. Biochem. 72:248; 1976). 

EXAMPLE 5 

Pilot Scale Purification of Glucocerebrosidase from the Intercellular Fluid of Field Grown 
35 Tobacco 

100 kilograms of MD609 leaf tissue from transgenic tobacco expressing the 
lysosomal enzyme glucocerebrosidase was harvested from the field each day for a period of 
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two weeks. The tissue was stripped off the stalks by hand and*weighed. Five kilograms of 
leaves were placed into polyester bags (Filtra-Spec, 12-2-1053) and four x 5 kg bags of leaves 
were placed into a metal basket. The metal basket containing the leaf material was placed in a 
200 L Mueller vacuum tank containing - 100 liters of buffered solution (0.1 KPO4 buffer, pH 
5 6.0, 5 mM EDTA, 0.5% taurochohc acid, 10 mM fl-mercaptoethanol). A 70 lb. stainless steel 
plate was placed over the leaves/bags to assure complete immersion. A vacuum was pulled 
27 in. Hg, held for 1 minute and then rapidly released. This vacuum infiltration was repeated 
for a total of two cycles. Multiple applications of the vacuum without isolating the 
intercellular fluid constitutes a single infiltration procedure. An indication of complete 

1 0 ' infiltration is a distinct darkening in color of the underside of the leaf tissue. Following the 
vacuum infiltrations, the leaves and basket were removed from the vacuum tank. The bags 
containing the vacuum infiltrated leaves were allowed to gravity drain surface buffer for - 10 
minutes, prior to centrifugation. The intercellular fluid (IF) was recovered from the vacuum 
infiltrated leaves by centrifugation (1,800 x g, 30 minutes) using a Heine basket centrifuge 

15 (bowl dimensions, 28.0 inches diameter x 16.5 inches). Collected IF was filtered through a 
50 uM cartridge filter and then stored at 4°C, until the entire 100 kilograms of tissue was 
infiltrated. This process was repeated with the next set of four 5 kg bags (5 x 20 Kg cycles 
total) until all the tissue was infiltrated. Additional buffer was added during each infiltration 
cycle to completely immerse the tissue. Alternatively, the leaf tissue can be re-infiltrated by 

20 placing the leaves back in the infiltration vessel in the same buffer used above and the process 
repeated (IF-2). Additionally, the buffer may be drained from the infiltration vessel (spent 
buffer) and may be pooled with the 1st and 2nd IF fractions. Collectively, IF-1, IF-2 and 
Spent Buffer constitutes the IF pool. The volume of intercellular fluid collected from the 
infiltrated leaf tissue was between 42-170% of the leaf tissue by weight depending on the 

25 number of infiltrations carried out. 

Recombinant GCB was purified by loading the dilute intercellular (feed stream) 
directly on a Pharmacia Streamline 200 column containing Phenyl Streamline resin. 
Expanded bed chromatography enabled us to capture, clarify and concentrate our protein in 
one step without the need for centrifugation and/or microfiltration steps. The column was 

30 equilibrated and washed until UV-signal on recorder returned to baseline with 25 mM citrate, 
20% ethylene glycol, pH 5.0 and then eluted with 25 mM citrate, 70% ethylene glycol. The 
eluted material was sterile filtered by passing the eluted material through a 1 sq. ft. 0.8 urn 
Sartoclean GF capsule followed by a 1 sq. ft. 0.2 urn Sartobran P sterile filter (Sartorius, 
Corp.) and stored at 4°C until the next chromatography step. The eluted material from 4-5 

35 days of Phenyl Streamline chromatography runs was pooled together and further purified on a 
canon exchange resin, SP Big Beads (Pharmacia), equilibrated in 25 mM citrate, 75 mM 
NaCl, pH 5.0. GCB was eluted with a step gradient of 25 mM citrate, 0.4 M NaCl, 10 % 
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ethylene glycol, pH 5.0. All chromatography steps were carried out at room temperature. 
The eluted material was sterile filtered by passing the eluted material through a 1 sq. ft 0.8 
urn Sartoclean GF capsule followed by a I sq. ft. 0.2 urn Sartobran P sterile filter (Sartorius, 
Corp.) and stored at 4°C. 
5 Using the suicide substrate, conduritol B-epoxide (CBE), inhibition of recombinant 

glucocerebrosidase (rGCB) activity in the presence of plant glucosidases was achieved. 
Enzyme activity was measured at 37°C in a reaction mixture containing 5 mM 
methylumbelliferyl B-D glucoside, 0.1 M Potassium Phosphate, 0.15% Triton-XlOO, 0.125% 
sodium taurocholate, 0.1% bovine serum albumin, pH 5.9 with and without CBE. Total 
1 0 . glucosidase activity and rGCB activity were measured by hydrolysis of the fluorescent 

substrate 4-methylumbelliferyl glucopyranoside. Total protein was determined using the Bio- 
Rad Protein Assay based on the method of Bradford (Bradford, M. Anal. Biochem. 72:248; 
1975). 

Typically from 1 kilogram of field grown tobacco, expressing GCB, where IF-1 alone 
1 5 was collected we obtained 435,000 units of GCB at a specific activity of 2,745. The Units /kg 
increased to 755,000 with a specific activity of 3,400 when IF Pool was collected (IF-1, IF-2 
and spent buffer). 

EXAMPLE 6 

20 Total GCB (IF vs. Homogenate) in "GCB Field Test Virus" 

100 kilograms of MD609 leaf tissue from transgenic tobacco expressing the 
lysosomal enzyme glucocerebrosidase was harvested from the field each day for a period of 
two weeks. The tissue was stripped off the stalks by hand and weighed. Five kilograms of 
leaves were placed into polyester bags (Filtra-Spec, 12-2-1053) and four x 5 kg bags of leaves 

25 were placed into a metal basket. The metal basket containing the leaf material was placed in a 
200 L Mueller vacuum tank containing ~ 100 liters of buffered solution (0.1 KPO4 buffer, pH 
6.0, 5 mM EDTA, 0.5% taurocholic acid, 10 mM fl-mercaptoethanol). A 70 lb. stainless steel 
plate was placed over the leaves/bags to assure complete immersion. A vacuum was pulled 
27 in. Hg, held for 1 minute and then rapidly released. This vacuum infiltration was repeated 

30 for a total of two cycles. Following the vacuum infiltrations, the leaves and basket were 
removed from the vacuum tank. The bags containing the vacuum infiltrated leaves were 
allowed to gravity drain surface buffer for ~ 10 minutes, prior to centrifugation. The 
intercellular fluid (IF) was recovered from the vacuum infiltrated leaves by centrifugation 
(1,800 x g, 30 minutes) using a Heine basket centrifuge (bowl dimensions, 28.0 inches 

35 diameter x 16.5 inches). Collected IF was filtered through a 50 uM cartridge filter and then 
stored at 4°C, until the entire 100 kilograms of tissue was infiltrated. This process was 
repeated with the next set of four 5 kg bags (5 x 20 Kg cycles total) until all the tissue was 
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infiltrated. Additional buffer was added during each infiltration cycle to completely immerse 
the tissue. In order to evaluate how much enzyme was recovered in the intercellular fluid, the 
tissue from which the intercellular fluid was isolated was then homogenized in a Waring 
blender with 4 volumes of the same infiltration buffer as above, centrifuged and the 
5 supernatant assayed for enzyme activity. 

EXAMPLE 7 
Chops Experiment 

An experiment was carried out where 1 00 kilograms of MD609 leaf tissue of 

10 • transgenic tobacco expressing the lysosomal enzyme glucoccrebrosidase was harvested off 
the stalks by hand, weighed and chopped into small pieces to increase the surface area for 
buffer infiltration. Five kilograms of leaves were placed into polyester bags (Filtra-Spec, 12- 
2-1053) and four x 5 kg bags of leaves were placed into a metal basket. The metal basket 
containing the leaf material was placed in a 200 L Mueller vacuum tank containing ~ 100 

1 5 liters of buffered solution (0.1 KPO4 buffer, pH 6.0, 5 mM EDTA, 0.5% taurocholic acid, 10 
mM 6-mercaptoethanol). A 70 lb. stainless steel plate was placed over the leaves/bags to 
assure complete immersion. A vacuum was pulled 27 in. Hg, held for 1 minute and then 
rapidly released. This vacuum infiltration was repeated for a total of two cycles. Following 
the vacuum infiltrations, the leaves and basket were removed from the vacuum tank. The 

20 bags containing the vacuum infiltrated leaves were allowed to gravity drain surface buffer for 
- 10 minutes, prior to centrifugation. The intercellular fluid (IF) was recovered from the 
vacuum infiltrated leaves by centrifugation (1,800 x g, 30 minutes) using a Heine basket 
centrifuge (bowl dimensions, 28.0 inches diameter x 16.5 inches). Collected IF was filtered 
through a 50 uM cartridge filter and then stored at 4°C, until the entire 100 kilograms of tissue 

25 was infiltrated. This process was repeated with the next set of four 5 kg bags (5 x 20 Kg 
cycles total) until all the tissue was infiltrated. Additional buffer was added during each 
infiltration cycle to completely immerse the tissue. In order to evaluate how much enzyme 
was recovered in the intercellular fluid, the tissue from which the intercellular fluid was 
isolated was then homogenized in a Waring blender with 4 volumes of the same infiltration 

30 buffer as above, centrifuged and the supernatant assayed for enzyme activity. 

Recombinant GCB was purified by loading the dilute intercellular (feed stream) 
directly on a Pharmacia Streamline 200 column containing Phenyl Streamline resin. The 
column was equilibrated and washed until UV-signal on recorder returned to baseline with 25 
mM citrate, 20% ethylene glycol, pH 5.0 and then eluted with 25 mM citrate, 70% ethylene 

35 glycol. All chromatography steps were carried out at room temperature Table 10 below 
contains data from the chops experiment 



43 



WO 02/08404 



PO7US01/24111 



EXAMPLE 8 

Pilot Scale of Purification of Alpha Galactosidase from the Intercellular Fluid of Nicotiana 
benthamiana 

Young actively growing Nicotiana benthamiana plants were inoculated with 
5 infectious transcripts of a recombinant plant viral construct containing the lysosomal enzyme 
a galactosidase gene. Systemically infected leaf tissue (1-2 kilograms) was harvested from 
Nicotiana benthamiana expressing a galactosidase 14 days post inoculation. The tissue was 
weighed and submerged with 2-4 volumes of buffer (25 mM Bis Tris Propane Buffer, pH 6.0, 
5 mM EDTA, 0.1 M NaCl, 10 mM B-mercaptoethanol) in an infiltration vessel that can 

10 i accommodate several kilograms of leaf tissue at one time. A perforated metal plate was 

placed on top of tissue to weigh down the tissue. A vacuum of 25-27 in. Hg was applied for 
30 seconds and then quickly released. The tissue was rotated and the vacuum reapplied to 
achieve complete infiltration which was confirmed by a distinct darkening in color of the 
underside of the leaf tissue. Excess buffer on the tissue was drained. The intercellular fluid 

1 5 was released from the tissue by centrifuging the tissue in a basket rotor at 3 800 RPM (2 100 x 
g) for 10-15 minutes. The intercellular fluid was collected using an aspirator hooked up to a 
vacuum pump. In some instances only infected leaf tissue was harvested. Alternatively, 
petioles and stems have been harvested along with the leaf tissue for infiltration. The mid 
vein was not removed from the tissue prior to infiltration. 

20 Alpha galactosidase was purified by loading the dilute intercellular (fed stream) 

directly onto a Pharmacia Streamline 25 column containing Butyl Streamline resin. 
Expanded bed chromatography enabled us to capture, clarify and concentrate our protein in 
one step without the need for centrifugation and/or microfiltration steps. The column was 
equilibrated and washed until UV-signal on recorder returned to baseline with 25 mM Bis 

25 Tris Propane, pH 6.0 20% (NH4)2S04 and then eluted with 25 mM Bis Tris Propane, pH 6.0. 
The eluted material was further purified on Hydroxyapatite equilibrated with 1 mM NaP04 
Buffer, 5% glycerol, pH 6.0 and eluted with either a 1-250 rnM NaP04 buffer, 5% glycerol, 
pH 6.0 linear gradient or a step gradient. All chromatography steps were carried out at room 
temperature. 

3 0 Alpha galactosidase activity was measured by hydrolysis of the fluorescent substrate 

4-methylumbelIiferyl a-D galactopyranoside. Enzyme activity was measured at 37°C in a 
reaction mixture containing 5 mM methylumbelliferyi a-D galactopyranoside, 0.1 M 
Potassium Phosphate, 0.15% Triton-XlOO, 0.125% sodium taurocholate, 0.1% bovine serum 
albumin, pH 5.9. Total protein was determined using the Bio-Rad Protein Assay based on the 

35 method of Bradford (Bradford, M. Anal. Biochem. 72: 248; 1976). 
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From 1 kilogram of leaves, we typically obtain between 140 - 160 million units of a 
galactosidase at a specific activity of 800,000 following a single infiltration procedure (IF-1). 
Table 1 1 below contains data that is representative of several experiments. 



5 EXAMPLE 9 

PILOT SCALE PURIFICATION OF GLUCOCEREBROSEDASE FROM THE 
INTERCELLULAR FLUID OF FIELD GROWN TOBACCO 

Transgenic tobacco (MD609) expressing the lysosomal enzyme glucocerebrosidase 
was mechanically inoculated with a tobacco mosaic virus derivative containing a coat protein 

10 - loop fusion, TMV291, (Turpen, et.al., 1995, Bio/Technology 13: 23-57). A total of 100 Kg 
of transgenic, transfected leaf tissue was harvested from the field, five weeks post inoculation. 
The tissue was stripped off the stalks by hand and weighed. Five kilograms of leaves were 
placed into polyester bags (Filtra-Spec, 12-2-1053) and four x 5 kg bags of leaves were placed 
into a metal basket The metal basket containing the leaf material was placed in a 200 L 

1 5 Mueller vacuum tank containing - 1 00 liters of buffered solution (0. 1 KPO4 buffer, pH 6.0, 5 
mM EDTA, 0.5% taurocholic acid, 10 mM 0-mercaptoethanol). A 70 lb. stainless steel plate 
was placed over the leaves/bags to assure complete immersion. A vacuum was pulled 27 in. 
Hg, held for 1 minute and then rapidly released. This vacuum infiltration was repeated for a 
total of two cycles. Multiple applications of the vacuum without isolating the intercellular 

20 fluid constitutes a single infiltration procedure. An indication of complete infiltration is a 
distinct darkening in color of the underside of the leaf tissue. Following the vacuum 
infiltrations, the leaves and basket were removed from the vacuum tank. The bags containing 
the vacuum infiltrated leaves were allowed to gravity drain surface buffer for ~ 10 minutes, 
prior to centrifugation. The intercellular fluid (IF) was recovered from the vacuum infiltrated 

25 leaves by centrifugation ( 1 ,800 x g, 30 minutes) using a Heine basket centrifuge (bowl 

dimensions, 28.0 inches diameter x 16.5 inches). Collected IF was filtered through a 50 uM 
cartridge filter and then stored at 4°C, until the entire 100 kilograms of tissue was infiltrated. 
This process was repeated with the next set of four 5 kg bags (5 x 20 Kg cycles total) until all 
the tissue was infiltrated. Additional buffer was added during each infiltration cycle to 

30 completely immerse the tissue. 

Recombinant GCB was purified by loading the dilute intercellular (feed stream) 
directly on a Pharmacia Streamline 200 column containing Phenyl Streamline resin. 
Expanded bed chromatography enabled us to capture, clarify and concentrate our protein in 
one step without the need for centrifugation and/or microfiltration steps. The column was 

35 equilibrated and washed until UV-signal on recorder returned to baseline with 25 mM citrate, 
20% ethylene glycol, pH 5.0 and then eluted with 25 mM citrate, 70% ethylene glycol. The 
eluted material was sterile filtered by passing the eluted material through a 1 sq. ft. 0.8 urn 
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Sartoclean GF capsule followed by a 1 sq. ft. 0.2 urn Sartobran*P sterile filter (Sartorius, 
Corp.) and stored at 4°C until the next chromatography step. The eluted material from 4-5 
days of Phenyl Streamline chromatography runs was pooled together and further purified on a 
cation exchange resin, SP Big Beads (Pharmacia), equilibrated in 25 mM citrate, 75 mM 
5 NaCl, pH 5.0. GCB was eluted with a step gradient of 25 mM citrate, 0.4 M NaCl, 10 % 
ethylene glycol, pH 5.0. All chromatography steps were carried out at room temperature. 
The eluted material was sterile filtered by passing the eluted material through a I sq. ft. 0.8 
urn Sartoclean GF capsule followed by a 1 sq. ft. 0.2 um Sartobran P sterile filter (Sartorius, 
Corp.) and stored at 4°C. 

10 , Using the suicide substrate, conduritol B-epoxide (CBE), inhibition of recombinant 

glucocerebrosidase (rGCB) activity in the presence of plant glucosidases was achieved. 
Enzyme activity was measured at 37°C in a reaction mixture containing 5 mM 
methylumbelliferyl fl-D glucoside, 0. 1 M Potassium Phosphate, 0.15% Triton-XlOO, 0.125% 
sodium taurocholate, 0. 1% bovine serum albumin, pH 5.9 with and without CBE. Total 

1 5 glucosidase activity and rGCB activity were measured by hydrolysis of the fluorescent 

substrate 4-methylumbeiliferyl glucopyranoside. Total protein was deterrnined using the Bio- 
Rad Protein Assay based on the method of Bradford (Bradford, M. Anal. Biochem. 72:248; 
1976). Table 12 contains the GCB recovery data from TMV transfected plant tissue. 
The quantity of virus present in IF extracted leaf tissue was determined using 

20 homogenization and polyethylene glycol precipitation methods. In addition, the amount of 
virus present in the pooled, intercellular fluid was determined by direct polyethylene glycol 
precipitation. Final virus yields from precipitated samples was determined 
spectrophotometrically by absorbance at 260 nm. 

25 Table 6 



Sample 


Virus Titer 


IF extracted leaf tissue 


0.206 mg virus/g fresh weight 


Pooled IF 


0.004 mg virus/g fresh weight, 0.010 mg virus/ml IF 



EXAMPLE 10 

MAKING rGAL-A ENZYME 
30 Experimental Results. Achieving high steady-state mRNA levels is a prerequisite for 

vector development. However, there are many complex biochemical and host compatibility 
interactions that ultimately determine the overall performance of a heterologous expression 
system for a given protein. For this reason, we initiated some preliminary experiments to test 
the potential for RNA-viral mediated synthesis of active rGal-A in whole plants. 
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In order to ensure efficient delivery of rGal-A into the lumen of the plant endoplasmic 
reticulum, we fused the Gal-A cDNA (3 1) to a plant signal peptide sequence derived from 
rice cc-amylase gene (32,33). We also hypothesized that addition of an ER-retention signal 
5 (SEKDEL) might prolong the resident time of the recombinant protein in the ER to increase 
the fraction of correctly assembled and catalytically active enzyme under extreme conditions 
of protein synthesis. These constructs were subcloned into the viral vector TTODA, a 
chimera between tobacco and tomato mosaic viruses (Fig. 1). Transcripts were prepared in 
vitro and inoculated onto the lower leaves of whole plants (Nicotiana benthamiana). 1-3 
10 « weeks after inoculation, leaves were weighed, rolled in a strip of Parafilm and placed in a 
disposable chromatography column and submerged in enzyme extraction buffer (0.1 M 
K/P04, 0. 1 M NaCl, 5 mM EDTA, 10 mM 0-ME and 0.5% sodium taurocholate, pH 6.0). In 
order to infiltrate the buffer into the tissue, a vacuum of 730-750 mmHg was twice applied. 
After draining the excess buffer, the intercellular fluid fraction was recovered by low-speed 
1 5 centrifugation (-1 ,500 X g, 1 5 min). To measure enzyme remaining in the tissue after this 
treatment, the leaf was unrolled after centrifugation and two discs removed with a #14 cork 
borer. This tissue sample was transferred to an eppendorf tube, frozen in liquid nitrogen and 
ground in four volumes of enzyme extraction buffer. In rGal-A enzyme assays, we measured 
cleavage of the fluorogenic substrate 4-methyl umbeliferyl ct-D-galactopyranoside (4-MUG) 
20 against known standards using established protocols (34). Units are nmoles of 4-MUG 
hydrolyzed per hour at 37*C, 

In several initial experiments, plant leaves transfected with all constructs accumulated 
1-2% of the total soluble plant protein as cross reacting immunologic material (CRhM) using 
antisera specific for Gal-A in quantitative Western analyses (data not shown). However, 
25 enzyme activity was much lower than expected for this amount of CRIM and furthermore was 
only 2-4 fold higher than activity due to endogenous plant cc-galactosidase isozymes (See 
Table 1: Experiment 10719/96). It also appeared that addition of the ER retention signal 
allowed highest accumulation of steady state activity and that the IF contained little if any 
additional activity or CRIM. There are three cellular fates for any glycoprotein synthesized in 
30 a plant leaf: secretion to the IF, retention in the ER or sorting to the vacuole (35). We 

reasoned that because the ER retention signal slightly increased expression, the majority of 
the enzyme was inactivated later in the secretory pathway. This could most likely occur by 
aggregation in the trans-golgi network as is reported during over-production of this enzyme in 
CHO-cells (36), and/or in the plant leaf vacuole. The IF fraction is quite clear and non- 
35 pigmented and is suitable for direct chromatography. Using the initial construct (rGal- 

SEKDEL) we partially purified small amounts of rGal-A from the IF on hydrophobic, lectin 
and size exclusion resins. 
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For several plant proteins vacuolar sorting information is located in a carboxy- 
terminal propeptide (CTPP; 37,38). During the original cloning and characterization of 
human Gal-A, Quinn et ah, postulated a cathepsin-Iike potential CTPP cleavage for this 
enzyme at or near two arginine residues, 26 and 28 AA from the termination codon (39,40). 
5 The precise AA sequence at the carboxy terminus has, to our knowledge, never been reported 
Because secretion in the plant leaf is through a default pathway we reasoned that deletion of 
specific sorting information from a postulated CTPP might yield more active enzyme in the 
IF. Analysis of a second set of constructs containing either 12 or 25 AA truncations, with and 
without the ER retention signal provided dramatic evidence for the significance of this region 
10 . (See Table 13). In one construct, rGal 12-SEKDEL, virtually all of the CRIM is now 

assembled and stored as fully active enzyme and is secreted to the IF in significant quantities. 
As demonstrated in Fig. 3, rGal-A (-52 kDa) is now the most abundant plant protein in a 
crude leaf IF sample. The other predominant band at 17.5 kDa is the viral structural protein 
which likely contaminates this fraction from broken trichomes of the leaf surface. 

15 

Table 1 3 . rGal-A Expression (U/Gram Leaf Tissue) 
Contract/Sample Intercellular Residual Total 



Fluid Homogenate 



Experiment 








Uninfected Plant 


2,800 


7,500 


10,300 


rGal-A 


5,100 


10,900 


25,000 


rGal-A 


5,400 


15,000 


20,400 


rGal-A-SEKDEL 


6,800 


30,300 


37,100 


rGal-A- SEKDEL 


5,200 


34,500 


39,700 


Experiment 








Uninfected Plant 


2,300 


4,800 


7,100 


rGal-A 25 


4,000 


8,900 


12,900 


rGal-A 25 


2,300 


9,000 


11,300 


rGal-A 25 -SEKDEL 


3,200 


10,000 


13,200 


rGal-A 25-SEKDEL 


2,800 


8,600 


11,400 


rGal-A 12 


5,500 


11,700 


17,200 


rGal-A 12-SEKDEL 


109,800 


117,700 


227,500 


rGal-A 12-SEKDEL 


199,000 


329,500 


528,500 
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Example 1 1 

a-GALACTOSIDASE EXPRESSION VECTOR DEVELOPMENT, CONSTRUCTION 
AND TESTING 

5 The following example describes the series of a-galactosidase vectors that were constructed 

and tested for enzyme production. Initially, the human a-galactosidase A cDNA (Sugimoto, Y., 
Aksentijevich, L, Murray, G. J., Brady, R. O., Pastan, L, and Gottesman, M. M. Retroviral 
coexpression of a multidrug resistance gene (MDRI) and human a-galactosidase A for gene therapy 
of Fabry disease. Human Gene Therapy 6:905, 1995.)was fused to a plant signal peptide sequence 

1 0 1 derived from a rice a-amylase gene (Kumagai, M. H., Shah, M., Terashima, M, Vrkljan, Z., 

Whitaker, J. R., and Rodriguez, R. L. Expression and secretion of rice a-amylase by Saccharomyces 
cerevisiae. Gene 94:209, 1990.). This chimeric gene was subcloned into the TMV based expression 
vector TTODA resulting in a construct designated rGAL-A , see Table 1 . Vector rGAL-A was 
modified by the addition of the putative ER retention signal SEKDEL, resulting in the vector 

15 designated rGAL-AR, see Table 1. 

A series of C-terminal amino acid deletions were introduced into the a-galactosidase gene. 
Deletions of 4, 8, 12 or 25 codons from the C-terminus of rGAL-A were generated as well as the 
addition of the putative ER retention sequence (SEKDEL), see Table 1 and Figure 1 (sequence of 
TTODA (rGAL-12R). The deletion vectors were designated as described in Table 8: 
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Table 8. 



v cvLUi L/C alexia Hull 


Pnrhmrv-TpTmfnnl Modi fi cations ( Amino Acid Seouence^ 








TSRLRSHT^rP^XrTVLLOLE^^^MOMSLKDLL 


rGAL-AR 


TSRLRSHINPTGTVLLQLENTKtQMSLKDLLSEKDEL 


rGAL-4 


TSRLRSHINPTGTVLLQLENTMQMSL 


rGAL-4R 


TSRLRSHINPTGTVLLQLENTMQMSLSEKDEL 


rGAL-8 


TSRLRSHIKPTGTVLLQLENTM 


IGAL-8R 


TSRLRSHINPTGTVLLQLENTMSEKDEL 


rGAL-12 


TSRLRSHINPTGTVLLQL 


rGAL-12R 


TSRLRSfflNFTGTVLLQLSEKDEL 


rGAL-25 


TSRLR 


rGAL-25R 


TSRLRSEKDEL 



The a-galactosidase gene fragment present in vector rGAL-12R was placed into TMV vector 
SBS5. In addition, the rice a-amylase signal peptide present in KJAL-12R was replaced by the native 
5 human a-galactosidase signal peptide. The resultant vector designated SBS5-rGAL- 12R , see Figure 
2., exhibited genetic stability upon serial passage on N. benthamiana plants. 

a-galactosidase was extracted from inoculated plants using interstitial fluid and 
homogenization methods. Fluids were analyzed for a-galactosidase yield, enzyme activity and 
cellular partitioning and targeting, see Table 15. In all cases, infectious transcripts were prepared in 

1 0 vitro and inoculated onto the lower leaves of actively growing Nicotiana benthamiana plants. 
Characteristic viral symptoms, vein clearing and leaf curling, were noted -6-8 dpi (days post 
inoculation). Tissue samples were obtained from infected plants 1-3 weeks after inoculation. Leaves 
were weighed, rolled in a strip of Parafilm and placed in a disposable chromatography column and 
submerged in extraction buffer (0.1 M K/P04, 0.1 M NaCl, 5 mM EDTA, 10 mM [3-ME and 0.5% 

1 5 sodium taurocholate, pH 6.0). Extraction buffer was infiltrated into the tissue by pumping a vacuum 
of 730-750 rarnHg. The vacuum was applied and released two times. After draining the excess buffer, 
the interstitial fluid (IF) fraction was recovered by low-speed centrifugation (-1,500 X g, 15 min). To 
measure enzyme remaining in the tissue after this treatment, the leaf was unrolled after centrifugation 
and two discs (-1 mg each) removed with a #14 cork borer. This tissue sample was transferred to an 

20 Eppendorf tube, frozen in liquid nitrogen and ground in four volumes of extraction buffer. The total 
homogenate was then subjected to centrifugation at ~5,000xg and the supernatant fraction was saved 
for further analysis. 

Extracts from IF and homogenates from post-IF leaf tissue were analyzed for enzymatic 
activity by the hydrolysis of the fluorescent substrate 4-methyl umbeliferyl a-D-galactopyranoside (4- 
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MUG). Known standards and established protocols (Suzuki, R. Enzymatic diagnosis of 
sphingolipidoses. Meth. Enzy. 138:727, 1987.) were used to obtain the number of enzymatic units 
(nmoles of 4-MUG hydrolyzed per hour at 37"C) per gram fresh weight of tissue harvested. 



5 Table 9. 



Vector 


Interstitial Fluid 


Homogenate 


Total Enzyme 


Ratio of Activity 


Designation 


Units/gram leaf 


Units/gram leaf 


Activity 


IF/Homogenate 








Units/gram leaf 














Uninfected 


3,837 


9,404 


13,241 


0.41 


rGAL-A 


6,833 


189,971 


196,804 


0.04 


rGAL-AR 


6,829 


312,068 


318,897 


0.02 


rGAL-4 


16,088 


262,806 


278,894 


0.06 


rGAL-4R 


8,245 


357.414 


365,659 


0.02 


rGAL-8 


261,814 


524,857 


789,671 


0.50 


rGAL-8R 


10,628 


469,956 


480,584 


0.02 


rGAL-12 


2,564 


8,743 


11,307 


0.29 


K5AL-12R 


305,803 


1,033,921 


1,339,724 


0.30 


rGAL-25 


1,265 


6,629 


7,894 


0.19 


rGAL-25R 


2,489 


6,394 


8,883 


0.39 



Enzyme activity data from IF and homogenates derived from plants expressing a- 
galactosidase from the vectors in Table 2. indicate that carboxy-termmal deletions (4-12 
codons) results in increased ct-galactosidase expression. Vector rGAL-12R expressed the 
1 0 highest level of total o>galactosidase and also secreted the highest quantity of active enzyme. 

EXAMPLE 12 

PILOT SCALE PURIFICATION OF a-GALACTOSEDASE 

1 5 Actively growing Nicotiana benthamiana plants, propagated in an uncontrolled horticultural 

greenhouse, were inoculated with encapsidated transcripts derived from the expression vector, SBS5- 
rGAL-12R. Tissue was harvested 14-17 days post inoculation. . Five kilograms of leaves were 
placed into polyester bags (Filtra-Spec®, 12-2-1053) and four x 5 kg bags of leaves were placed into a 
metal basket The metal basket containing the leaf material was placed in a 200 liter Mueller® 

20 vacuum tank containing ~ 100 liters of buffered solution (50 mM acetate, 5 mM EDTA, 10 mM 2- 
mercaptoethanol, pH5.0). A 70 lb. stainless steel plate was placed over the leaves/bags to assure 
complete immersion. A vacuum was pumped to 695 mm Hg, held for 1 minute and then rapidly 
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released. This vacuum infiltration was repeated for a total of tivo cycles. Multiple applications of the 
vacuum without isolating the interstitial fluid constitutes a single infiltration procedure. An indication 
of complete infiltration is a distinct darkening in color of the underside of the leaf tissue. Following 
the vacuum infiltrations, the leaves and basket were removed from the vacuum tank. The bags 
containing the vacuum infiltrated leaves were allowed to gravity drain surface buffer for - 10 minutes, 
prior to centrifugation. The interstitial fluid (IF) was recovered from the vacuum infiltrated leaves by 
centrifugation (1,800 x G, 30 minutes) using a Heine® basket centrifuge (bowl dimensions, 28.0 
inches diameter x 16.5 inches). The IF was filtered through a 50 um cartridge filter to remove plant 
debris prior to purification. 

Ammonium sulfate was added to the IF to 15% saturation, mixed for 10 minutes and loaded 
onto a Pharmacia Streamline 200 column containing 4 liters of Butyl Streamline resin equilibrated 
with 25 mM Imidizole, 15% (NHO2SO4, pH 6.0 at 1.2 Uminv The column was washed to UV 
baseline with 25 mM Imidizole, pH 6.0, 15% (NH^SC^ and a Gal was eluted with a step gradient of 
25 mM Imidizole, pH 6.0. The eluent was filtered through a Sartorius glass fiber -> 0.8 um cartridge 
filter and loaded directly onto 3 liters of Blue Sepharose in a Pharmacia BPG 200 column equilibrated 
with 25 mM Imidizole, pH 6.0. The column was washed to UV baseline with 25 mM Imidizole, pH 
6.0 and a gal was eluted with a step gradient of 25 mM Imidizole, 650 mM NaCl, pH 6.0. The eluent 
was concentrated using a 10 kD MWCO, cellulose acetate, 3 ft 2 spiral membrane in an Amicon CH-2 
concentrator and then sterile filtered. 

Further purification was carried out either on Octyl Sepharose FF or Hydroxyapatite. For 
Octyl Sepharose the column was equilibrated with 25 mM Imidizole, 25% ammonium sulfate, pH 6.0 
and eluted using a linear gradient of 25 - 0% (NH^SC^ in 25 mM Imidizole, pH 6.0. For 
Hydroxyapatite purification, the sample was dialyzed overnight against 10 mM KPC>4Buffer, pH 7.0 
and then loaded on a column was equilibrated with 10 mM KP0 4 Buffer, pH 7.0. The column was 
washed with equilibration buffer until the UV reached baseline, followed by a linear gradient of 10 - 
200 mM KP04Buffer, pH 7.0. The a gal flowed through the column free of the contaminating 
proteins. 

Alpha gal activity was measured throughout the process with a fluorescent 
assay using the synthetic substrate, 4-methylumbelliferyl-a-D-galactopyranoside 
(MU-ot gal). Enzyme activity was measured at 37°C in a reaction mixture containing 
5 rnM methylumbelliferyl a-D-galactopyranoside, 0.1 M Potassium Phosphate, 
0.15% Triton-XlOO, 0.125% sodium taurocholate, 0.1% bovine serum albumin, pH 
5.9. ). One unit of enzymatic activity hydrolyzes l nmol of MU-a-gal per hour at 
37°C. Total protein was determined using the Bio-Rad Protein Assay based on the 
method of Bradford (Bradford, M. Anal. Biochem. 72: 248; 1976). Results of o> 
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galactosidase activity (Total units and specific activity) from different enzyme 
production lots are shown in TablelO. 
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Table 10. 



Lot Number 


Kg Biomass Extracted 


Total Units (IF) 


Specific Activity 








Units/mg protein 
(Purified) 










981215 


44.4 


2.9 x 10* 


5.0x10* 


991115 


100 


5.5 x 10 9 


3.6 xlO 6 


991116 


120 


6.9 x 10 9 


4.0 x 10* 


991117 


120 


5.9 x 10* 


3.5 x 10*" 


991118 


95.6 


7.0 x 10 9 


3.5 xlO 6 ' 



*Butyl eluents from lots 991 1 17 and 991 1 1 8 were pooled before final chromatography and 
5 purification. The data presented in Table 16. demonstrates the consistency of pilot-scale 
extraction with respect to secreted enzyme yield and specific activity. 



Example 13 

Analysis of Purified cc-Galactosidase 

10 N-texminal Sequence Analysis 

N-terminal sequence analysis of a-galactosidase, purified from plants inoculated with 
transcripts derived from the vector K5AL-12R, MLDNGLARTPT, had a 100% sequence homology to 
1 1 amino acids of human placental a-galactosidase with the addition of an N-terminai methionine. In 
contrast, N-terminal sequence of a-galactosidase, purified from plants inooulated with transcripts 

15 derived from the vector SBS5-K3AL-12R, LDNGLARTPT, was as expected from native human 
enzyme. These data indicates the high degree of fidelity that post-translational modifications are 
carried out within plant leaf cells and that human signal peptides are processed with equal specificity 
in plants as in the native mammalian source. 

20 C-Terminal sequence Analysis 

C-Terminal sequence of the rGAL-12R and SBS5-rGAL-12R plant produced enzyme was 
obtained by Edman degradation using the commercial service of the Mayo Foundation. Three cycles 
were achieved before the signal was too low to read additional sequence. Expected C-Terminus: 
LLQLSEKDEL 

25 

Cycle Major amino acids 
1st L,E 
2nd D, V, A 
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3rd Q,G,T 

It is important to note that the C-terminal amino acid was found to be heterogeneous, either L 
or E. The presence of glutamic acid in the first cycle greatly reduced the signal because glutamic acid 
5 can form a cyclic structure during the activation step that disables cleavage from the chain and 
therefore blocks a portion of the sample to further sequencing. This reduced that ability of the 
software to interpret cycle 3 and beyond. However, the presence of L, E and D in the first two cycles 
and the absence of other amino acids present in the analysis in an order resembling the ct- 
galactosidase sequence strongly suggests that a population of the enzyme terminates with a DEL 
1 0 • sequence as expected from the sequence of the DNA clone. 

Molecular weight Determination 

The apparent molecular weight of SBS5-rGAL-12R derived a-galactosidase (-50 kDa) was 
quite similar to human a-galactosidase A, purified from human placenta, as judged by both coomassie 
15 and silver stained SDS-PAGE. However, the protein purified from plant sources showed less 
molecular weight variation than the native human protein, indicating less heterogeneity in plant 
glycosylation or a higher purity plant enzyme preparation. 

The molecular mass of several lots of plant derived a-galactosidase were determined by 
MALDI-TOF mass spectroscopy to be 48,963, 48,913, 49,100 daltons. These weights are consistent 
20 with the predicted mass of a-galactosidase, based upon amino acid sequence, allowing for broader 
peaks due to glycosylation. The calculated molecular weight of SBS5-rGAL-12R derived a- 
galactosidase is 44,619. The difference in predicted and observed mass would equate to 
approximately 10.0% carbohydrate. 

25 Glycan Analysis 

There are four potential N-glycosylation consensus sequences (N-X-T/S) reported for human 
a gal A (Matsuura, et. al. Glycobiology 8:329-339, 1998). We have identified four potential sites 
(108, 161, 184, 377) in our plant expressed a gal. One potential glycosylation site, in our a gal, is not 
glycosylated (377), as is the case for human a gal A expressed in CHO-cells. 

3 0 Plants have both high mannose and complex glycans that differ from mammalian 

complex glycans by the presence of an al,3 fucose on the proximal GlcNac and a pi,2 xylose 
on the p-linked mannose of the core. Four potential N-glycosylation sites have been identified 
for the plant derived a-galactosidase. The predicted amino acid sequence has four possible 
glycosylation sites (Asn-Xaa-Ser/Thr) at Asn residues (108, 161, 184,377). The 

35 glycosylation site at amino acid 377 was not glycosylated, similar to CHO cell derived a- 

galactosidase glycosylation. The four possible N-glycosylation sites are all located in P turns 
within hydrophilic regions of the enzyme. It was estimated the mature human a-galactosidase 
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consists of about 370 amino acids and approx. 15% carbohydrate (Calhoun et al. PNAS 82: 
7364-7368, 1985). Matsuura et al (Glycobioiogy 8:329-339,1998) reports that in CHO-cell 
produced a gal there are four N-glycosylation sites (139, 193, 215, 407) and 3 of the 4 sites 
are occupied (407 is not glycosylated). 
5 We have determined that our plant expressed protein is indeed glycosylated because 

the enzyme will bind to ConA which has a specificity for high mannose structures. Also, the 
plant derived enzyme was chemically deglycosylated with TFMS (trifluoromethanesulfonic 
acid). The a-galactosidase appeared to be cleaved as observed by a shift in molecular weight 
on both a silver stained gel and a Western blot with a gal antibody. Early attempts to cleave 

10 ' rGal-A with PNGaseF to release N-linked carbohydrate have been unsuccessful suggesting 
the presence of al,3 fucose on the terminal GlcNac of the carbohydrate side chain. This was 
verified by glycan analysis work carried out by the Glycobioiogy Core Group at University of 
California San Diego Cancer Center. Carbohydrate profiling and compositional analysis was 
done. NMR experiments confirmed that rGal-A from the plant IF contains an N-linked 

15 glycan containing plant-specific carbohydrate linkages of a pi ,2 xylose and a 1,3 fucose on 
the trimannosyl core. This N-linked structure has been previously reported to occur in 
glycoproteins isolated from plant seeds and tissue cultures. Five (5) ug was hydrolyzed with 
2M TFA for 4 hours and analyzed by HPAEC-P AD. The total amount of sugar and sugar 
content was 560 ug and 12%. NMR analysis of the major peak showed a trimannosyl- 

20 chitobiose core, with a 1 ,3 linked fucose and a pi ,2 linked xylose. 

a-galactosidase glycan structures were determined by MALDI-TOF and/or MALDI- 
MS in collaboration with the Universitaet fuer Bodenkultur, see Table 11. For MALDI, 5 ug 
of plant derived a-galactosidase was digested with pepsin in a mass ratio of 1:40 in 5 % 
formic acid. After evaporation the peptides were dissolved in ammonium acetate buffer, pH 

25 5 .0, boiled and subsequently digested with PNGase A overnight Since the sample has a mass 
of 49.000 g/mo\, there are 100 pmol of glycoprotein. After evaporation, the peptides were 
removed by cation exchange chromatography and the glycans are analyzed by MALDI (or 
pyridylaminated). 

The molecular mass of the glycan was detennined by MALDI-MS using a 
30 ThermoBio Analysis DYNAMO (linear MALDI-TOF MS with delayed extraction) 

instrument, A small portion of the sample was dried on the sample target and subsequently 
overlaid with "matrix" (gentisic acid). The samples contained complex type sugar chains with 
fucose, xylose and varying amounts of terminal GlcNAc. Small fractions were devoid of 
fucose and therefore amenable to hydrolysis by PNGase F. 
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Table 11 



Glycan Structure 


Molecular Weight 


Lot #980805 


Lot #981215 




Daltons 


% Glycan Structure 


% Glycan Structure 










MOXF 


1050.5 


3 




MMX 


1066.5 


2 


2 


MMXF 


1212.7 


22 


8 


Man 5 


1237.5 




1.8 


GnMX/MGnX 


1269.8 


6 


0.5 


GnMXF/MGnXF 


1416.4 


53 


16 


GnGnX 


1473.3 


.5 


7 


GnGnXF 


1619.9 


9 
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Table 12 



Characteristic 


Plant Derived 


CHO Cell Secreted 








Number of Core Structures 


8 


23 


Sialic Acid 


Absent 


Present 


Xylose 


p (1,2) Linkage 


Absent 


Fucose 


a (1,3) Linkage 


a (1,6) Linkage 


% Complex Structures 






% Weight Glycosylated 


10-12% 


15% 


Specific Activity 






Table 13 



a-Galactosidase Source 


Specific Activity 


Reference 




4-MU Substrate 










Nicouana benthamiana 
Human, Recombinant 


5.0 x 10 6 


This Patent Application 


Human Spleen 


1.88 xlO 6 


Bishop and Desnick, 1981, J, Biol. 
Chem. 256(3): 1307-1316 


Human Placenta 


0.99x10* 


Bishop and Desnick, 1981, J. Biol. 
Chem. 256 (3): 1307-1316 


Human Plasma 


7.4 x 10 5 


Bishop and Sweeley, 1978, Biochim. 
Bioph. Acta, 525:399-409 



This example demonstrates the ability to extract two different products from the same 
leaf tissue based upon extraction procedures that specifically target products localized in the 
apoplast and cytosol. 

10 CONCLUSIONS : 

The following includes a number of aspects of the invention: 

1. A method for producing a protein of choice comprising a lysosomal enzyme which is 
enzymatically active, comprising: 



15 recovering the lysosomal enzyme from (i) a transgenic plant cell or (ii) a cell, tissue or organ 
of a transgenic plant, which transgenic plant cell or plant is transformed or transfected with a 
recombinant expression construct comprising a nucleotide sequence encoding the lysosomal 
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enzyme and a promoter that regulates expression of the nucleotide sequence so that the 
lysosomal enzyme is expressed by the transgenic plant celJ or plant. 

2. The method according to claim 1, in which the promoter is an inducible promoter. 

5 

3. The method according to claim 2, in which the inducible promoter is induced by 
mechanical gene activation. 

4. The method according to claim 2, which is carried out with the transgenic plant and 

1 0 , additionally comprises a step of inducing the inducible promoter before or after the transgenic 
plant is harvested, which inducing step is carried out before recovering the lysosomal enzyme 
from the cell, tissue or organ of the transgenic plant. 

5. The method according to claim 1, in which the lysosomal enzyme is a modified lysosomal 
15 enzyme which is enzymatically active and comprises: 

(a) an enzymatically-active fragment of a human or animal lysosomal enzyme; 

(b) the human or animal lysosomal enzyme or (a) having one or more amino acid residues 
20 added to the amino or carboxyl terminus of the human or animal lysosomal enzyme or (a); or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally-occurring 
amino acid additions, deletions or substitutions. 

25 6. The method according to claim 5, in which the modified lysosomal enzyme comprises a 
signal peptide or detectable marker peptide at the amino or carboxyl tenninal of the modified 
lysosomal enzyme. 

7. The method according to claim 6, in which the modified lysosomal enzyme is recovered 
30 from (i) the transgenic plant cell or (ii) the cell, tissue or organ of the transgenic plant by 

reacting with an antibody that binds the detectable marker peptide. 

8. The method according to claim 7, in which the antibody is a monoclonal antibody. 

35 9. The method according to claim 5, in which the modified lysosomal enzyme comprises: 
(a) an enzymatically-active fragment of an . a.-N-acetylgalactosaminidise, acid lipase, . a.- 
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galactosidase, glucocerebrosidase, . a.-L-iduronidase, iduronate sulfatase, . a.-mannosidase 
or sialidase; 

(b) the a-N-acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L- 
5 iduronidase, iduronate sulfatase, a-mannosidase, sialidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the a-N-acetylgalactosaminidase, 
acid lipase, a-galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a- 
mannosidase, sialidase or (a); or 

10 (c) the a-N-acetylgalactosarninidase, acid lipase, a galactosidase, glucocerebrosidase, a-L- 
iduronidase, -iduronate sulfatase, a -mannosidase, sialidase or (a) having one or more 
naturaUy-occurring amino acid additions, deletions or substitutions. 

10. The method according to claim 9, in which the modified lysosomal enzyme comprises: 

15 

(a) an enzymatically-active fragment of a human glucocerebrosidase or human a-L- 
iduronidase enzyme; 

(b) the human glucocerebrosidase, human a-L-iduronidase or (a) having one or more amino 
20 acid residues added to the amino or carboxyl terminus of the human glucocerebrosidase, 

human a-L-iduronidase or (a); or 

(c) the human glucocerebrosidase, human a-L-iduronidase or (a) having one or more 
naturally-occurring amino acid additions, deletions or substitutions. 

25 

1 1. The method according to claim 5, in which the modified lysosomal enzyme is a fusion 
protein comprising: 

(I) (a) the enzymatically-active fragment of the human or animal lysosomal enzyme, 

30 

(b) the human or animal lysosomal enzyme, or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally-occurring 
amino acid additions, deletions or substitutions, and 

35 

(II) a cleavable linker fused to the amino or carboxyl terminus of (I); and the method 
comprises: 
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(i) recovering the fusion protein from the transgenic plant cell, or the cell, tissue or organ of 
the transgenic plant; 

5 (ii) treating the fusion protein with a substance that cleaves the clcavable linker so that (I) is 
separated from the cleavable linker and any sequence attached thereto; and 

(iii) recovering the separated (I). 

10 i 12. Hie method according to claim 1, in which the transgenic plant is a transgenic tobacco 
plant. 

13. The method according to claim 1, in which the lysosomal enzyme is a human or animal 
lysosomal enzyme. 

15 

14. The method according to claim 13, in which the lysosomal enzyme is an a-N- 
acetylgalactosaminidase, acid lipase, a galactosidase, glucocerebrosidase, a-L-iduronidase, 
iduronate sulfatase, a-mannosidase or sialidese. 

20 15. The method according to claim 1 4, in which the lysosomal enzyme is a human 
glucocerebrosidase or human ct-L-iduronidase. 

16. The method according to claim 1, in which the organ is leaf, stem, root flower, fruit or 
seed. 

25 

17. A recombinant expression construct comprising a nucleotide sequence encoding a protein 
of choice comprising a lysosomal enzyme and a promoter that regulates the expression of the 
nucleotide sequence in a plant cell. 

30 1 8. The recombinant expression construct of claim 1 7, in which the promoter is an inducible 
promoter. 

19. The recombinant expression construct of claim 18, in which the inducible promoter is 
induced by mechanical gene activation. 

35 

20. The recombinant expression construct of claim 17, in which the lysosomal enzyme is a 
modified lysosomal enzyme which is enzymatically active and comprises: 
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(a) an enzymatically-active fragment of a human or animal lysosomal enzyme; 

(b) the human or animal lysosomal enzyme or (a) having one or more amino acid residues 

5 added to the amino or carboxyl terminus of the human or animal lysosomal enzyme or (a); or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally occurring 
amino acid additions, deletions or substitutions. 

10 i 21. The recombinant expression construct of claim 20, in which the modified lysosomal 
enzyme comprises a signal perude or detectable marker peptide at the amino or carboxyl 
terminal of the modified lysosomal enzyme. 

22. The recombinant expression construct of claim 21, in which the detectable marker peptide 
1 5 comprises SEQ ID NO: 10. 

23. The recombinant expression construct of claim 20, in which the modified lysosomal 
enzyme comprises: 

20 (a) an enzymatically-active fragment of an a-N-acetylgalactosaminidase, acid lipase, a- 

galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a-mannosidase or 
sialydase; 

(b) the a-N-acetylgalactosaminidase, acid lipase, ct-galactosidase, glucocerebrosidase, a-L- 
25 iduronidase, iduronate sulfatase, a-mannosidase, sialidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the a-N-acetylgalactosaminidase, 
acid lipase, a-galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a- 
rnarmosidase, sialidase or (a); or 

30 (c) the a-N-acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L- 
iduronidase, iduronate sulfatase, a-mannosidase, sialidase or (a) having one or more 
naturally-occurring amino acid additions, deletions or substitutions. 

24. The recombinant expression construct of claim 23, in which the modified lysosomal 
35 enzyme comprises: 

(a) an enzymatically-active fragment of a human glucocerebrosidase or human a-L- 
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iduronidase enzyme; , 

(b) the human glucocerebrosidase or human oc-L-iduronidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the human glucocerebrosidase, 

5 human oc-L-iduronidase or (a); or 

(c) the human glucocerebrosidase, human cc-L-iduronidase or (a) having one or more 
naturally-occurring amino acid additions, deletions or substitutions. 

10 r 25. The expression construct of claim 20, in which the modified lysosomal enzyme is a fusion 
protein comprising 

(T) (a) the enzymatically-active fragment of the human or animal lysosomal enzyme, 

15 (b) the human or animal lysosomal enzyme, or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturaJly-orcurring 
amino acid additions, deletions or substitutions, and 

20 (II) a cleavable linker fused to the amino or carboxyl terminus of (I). 

26. The recombinant expression construct of claim 17, in which the lysosomal enzyme is a 
human or animal lysosomal enzyme. 

25 27. The recombinant expression construct of claim 25, in which the lysosomal enzyme is an 
a-N-acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, oc-L- 
iduronidase, iduronate sulfatase, a-rnannosidase or sialidase. 

28. The recombinant expression construct of claim 27, in which the lysosomal enzyme is a 
30 human glucocerebrosidase or human a-L-iduronidase. 

29. A plant transformation vector comprising the recombinant expression construct of claim 
17, 18, 20, 21, 24, 25, or 28. 

35 30. A plant which is transformed or transfected with the recombinant expression construct of 
claim 17, 18, 20, 2 1,24, 25, or 28. ' 

3 1 . A plant cell, tissue or organ which is transformed or transfected with the recombinant 
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expression construct of claim 17, 18, 20, 21, 24, 25, or 28. 

32. A plant transfection vector comprising the recombinant expression construct of claim 1 7, 
18, 20,21,24, 25, or 28. 

5 

33. A plasmid comprising the recombinant expression construct of claim 17, 18, 20, 21, 24, 
25, or 28. 

34. A transgenic plant or plant cell capable of producing a lysosomal enzyme which is 

1 0 . enzymaticalry active, which transgenic plant or plant cell is transformed or transfected with a 
recombinant expression construct comprising a nucleotide sequence encoding a lysosomal 
enzyme and a promoter that regulates expression of the nucleotide sequence in the transgenic 
plant or plant cell. 

15 35. The transgenic plant or plant cell of claim 34, in which the promoter is an inducible 
promoter. 

36. The transgenic plant or plant cell of claim 35, in which the inducible promoter is induced 
by mechanical gene activation. 

20 

37. The transgenic plant or plant cell of claim 36, in which the inducible promoter comprises 
SEQIDNO:5. 

38. The transgenic plant or plant cell of claim 34, in which the lysosomal enzyme which is a 
25 modified lysosomal enzyme which is enzymatically active and which comprises: 

(a) an enzymatically-active fragment of a human or animal lysosomal enzyme; 

(b) the human or animal lysosomal enzyme or (a) having one or more amino acid residues 
30 added to the amino or carboxyl terminus of the human or animal lysosomal enzyme or (a); or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally-occurring 
amino acid additions, deletions or substitutions. 

35 39. The transgenic plant or plant cell of claim 38, in which the modified lysosomal enzyme 
comprises a signal peptide or detectable marker peptide at the amino or carboxyl terminal of 
the modified lysosomal enzyme. 
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40. The transgenic plant or plant cell of claim 39, in which undetectable marker peptide 
comprises SEQ ID NO: 10. 

41. The transgenic plant or plant cell of claim 38, in which the modified lysosomal enzyme 
5 comprises: 

(a) an enzymarically-active fragment of an a-N-acetylgaiactosaminidase, acid lipase, a- 
galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a-maimosidase or 
sialidase; 

10 . 

(b) the a-N-acetylgalactosaniinidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L- 
iduronidase, iduronate sulfatase, a-mannosidase, sialidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the a-N-acetylgalactosaminidase, 
acid lipase, a-galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a- 

1 5 mannosidase, sialidase or (a); or 

(c) the a-N-acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L- 
iduronidase, iduronate sulfatase, a-mannosidase, sialidase or (a) having one or more 
naturally-occurring amino acid additions, deletions or substitutions. 

20 

42. The transgenic plant or plant cell of claim 41, in which the modified lysosomal enzyme 
comprises: 

(a) an enzymatically-active fragment of a human glucocerebrosidase or human a-L- 
25 iduronidase enzyme; 

(b) the human glucocerebrosidase, human a-L-iduronidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the human glucocerebrosidase, 
human a-L-iduronidase or (a); or 

30 

(c) the human glucocerebrosidase, human a-L-iduronidase or (a) having one or more 
naturally-occurring amino acid additions, deletions or substitutions. 

43. The transgenic plant or plant cell of claim 38, in which the modified lysosomal enzyme is 
35 a fusion protein comprising: 

(I) (a) the enzymatically-active fragment of the human or animal lysosomal enzyme, 
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(b) the human or animal lysosomal enzyme, or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally-occurring 
5 amino acid additions, deletions or substitutions, and 

(II) a cleavable linker fused to the amino or carboxyl terminus of (I). 

44. The transgenic plant or plant cell of claim 34, in which the transgenic plant or plant cell is 
10 , a transgenic tobacco plant or tobacco cell. 

45. The transgenic plant or plant cell of claim 34, in which the lysosomal enzyme is a human 
or animal lysosomal enzyme. 

15 46. The transgenic plant or plant cell of claim 45, in which the lysosomal enzyme is an a-N- 
acetylgalactosaminidase, acid lipase, o>galactosidase, glucocerebrosidase, ct-L-iduronidase, 
iduronate sulfatase, a-mannosidase or sialidase. 

47. The transgenic plant or plant cell of claim 46, in which the lysosomal enzyme is a human 
20 glucocerebrosidase or human oc-L-iduronidase. 

48. A leaf, stem, root, flower or seed of the transgenic plant of claim 34, 35, 38, 39, 42, 44, 
45, or 47. 

25 49. A seed of plant line Nicotiana sp., which seed has the ATCC Accession No. , 

deposited July 25, 2000. 

50. A plant grown from the seed of claim 49. 

30 51. A lysosomal enzyme which is enzymatically active and is produced according to a process 
comprising: 

recovering the lysosomal enzyme from (i) a transgenic plant cell or (ii) a ceil, tissue or organ 
of a transgenic plant which transgenic plant cell or plant is transformed or transfected with a 
3 5 recombinant expression construct comprising a nucleotide sequence encoding the lysosomal 
enzyme and a promoter that regulates expression of the nucleotide sequence so that the 
lysosomal enzyme is expressed by the transgenic plant cell or plant. 
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52. The lysosomal enzyme of claim 59, in which the promoter is an inducible promoter. 

53. The lysosomal enzyme of claim 52, which process is carried out with the transgenic plant 
5 and additionally comprises a step of inducing the inducible promoter before or after the 

transgenic plant is harvested, which inducing step is carried out before recovering the 
lysosomal enzyme from the cell, tissue or organ of the transgenic plant. 

54. The lysosomal enzyme of claim 5 1 , which is a modified lysosomal enzyme which is 
10 , enzymatically active and comprises: 

(a) an enzymatically-active fragment of a human or animal lysosomal enzyme; 

(b) the human or animal lysosomal enzyme or (a) having one or more amino acid residues 

1 5 added to the amino or carboxyl terminus of the human or animal lysosomal enzyme or (a); or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally-occurring 
amino acid, additions, deletions or substitutions. 

20 55 . The lysosomal enzyme of claim 54, in which the modified lysosomal enzyme comprises a 
signal peptide or detectable marker peptide at the amino or carboxyl terminal of the modified 
lysosomal enzyme. 

56. The lysosomal enzyme of claim 54, in which the modified lysosomal enzyme comprises: 

25 

(a) an enzymatically-active fragment of an a-N-acetylgalactosaminidase, acid lipase, a- 
galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a-mannosidase or 
sialidase; 

30 (b) the a-N-acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L- 
iduronidase, iduronate sulfatase, a-mannosidase, sialidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the a-N-acetylgalactosaminidase, 
acid lipase, a-galactosidase, glucocerebrosidase, a-L-iduronidase, iduronate sulfatase, a- 
mannosidase, sialidase or (a); or 

35 

(c) the a-N-acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L- 
iduronidase, iduronate sulfazase, a-mannosidase, sialidase or (a) having one or more 
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naturally-occurring amino acid additions, deletions or substitutions. 

57. The lysosomal enzyme of claim 56, in which the modified lysosomal enzyme comprises: 

5 (a) an enzymatically-active fragment of a human glucocerebrosidase or human cc-L- 
iduronidase enzyme; 

(b) the human glucocerebrosidase, human a-L-iduronidase or (a) having one or more amino 
acid residues added to the amino or carboxyl terminus of the human glucocerebrosidase, 

10 » human a-L-iduronidase or (a); or 

(c) the human glucocerebrosidase, human a-L-iduronidase or (a) having one or more 
naturally-occurring amino acid additions, deletions or substitutions. 

15 58. The lysosomal enzyme of claim 54, in which the modified lysosomal enzyme is a fusion 
protein comprising: 

(I) (a) the enzymatically-active fragment of the human or animal lysosomal enzyme, 

20 (b) the human or animal lysosomal enzyme, or 

(c) the human or animal lysosomal enzyme or (a) having one or more naturally-occurring 
amino acid additions, deletions or substitutions, and 

25 (II) a cleavable linker fused to the amino or carboxyl terminus of (I). 

59. The lysosomal enzyme of claim 51, in which the transgenic plant is a transgenic tobacco 
plant. 

30 60. The lysosomal enzyme of claim 5 1 , in which the lysosomal enzyme is a human or animal 
lysosomal enzyme. 

61 . The lysosomal enzyme of claim 60, in which the lysosomal enzyme is an a-N- 
acetylgalactosaminidase, acid lipase, a-galactosidase, glucocerebrosidase, a-L-iduronidase, 

35 iduronate sulfatase, a-mannosidase or sialidase. 

62. The lysosomal enzyme of claim 61, in which the lysosomal enzyme is a human 
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glucocerebrosidase or human a-L-iduronidase. * 

63. The lysosomal enzyme of claim 51, in which the organ is leaf, stem, root, flower, fruit or 
seed. 

5 
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10 

The present invention is not to be limited to scope by the biological material deposited 
since the deposited embodiments are intended as illustrations of the individual aspects of the 
invention, and any biological material, or constructs which are functionally equivalent are 
within the scope of this invention. Indeed, various modifications of the invention in addition 
15 to those shown and described herein will become apparent to those skilled in the art from the 
foregoing description and accompanying drawings. Such modifications are intended to fall 
within the scope of the appended claims. 

Various references are cited herein; these are incorporated by reference in their entirety. 
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We Claim: 

1 . An isolated glucocererbrosidase, or a fragment thereof. 

2. The isolated glucocererbrosidase, or a fragment thereof, according to Claim 1, 
5 wherein said isolated glucocererbrosidase, or a fragment thereof, is separated from the 

cellular constituents in which the glucocererbrosidase, or fragments thereof, are normally 
associated with in nature. 

3. The isolated glucocererbrosidase, or a fragment thereof, according to Claim 1, 
10 wherein said isolated glucocererbrosidase, or a fragment thereof, is purified. 

4. An isolated alpha-galactosidase, or a fragment thereof. 

5. The isolated isolated alpha-galactosidase, or a fragment thereof, according to 
15 Claim 1, wherein said isolated alpha-galactosidase, or a fragment thereof, is separated 

from the cellular constituents in which the alpha-galactosidase, or fragments thereof, are 
normally associated with in nature. 

6. The isolated alpha-galactosidase, or a fragment thereof, according to Claim 1, 
20 wherein said isolated alpha-galactosidase, or a fragment thereof, is purified. 

7. A purified composition comprising the amino acid sequence of 
glucocererbrosidase. 

25 8. A purified composition comprising the amino acid sequence of 
alpha-galactosidase. 

9. A method for producing glucocererbrosidase, comprising the steps of: 

a) growing a plant, wherein said plant comprises a recombinant expression 
30 construct comprising a nucleotide sequence encoding a glucocererbrosidase and a 

promoter that regulates the expression of the nucleotide sequence in said plant; and 

b) isolating glucocererbrosidase from said plant. 



76 



WO 02/08404 



PCT/US01/24111 



10. The method according to Claim 9, wherein said isolating comprises the steps of: 

i) breaking open said plant; 

ii) isolating the interstitial fluid of said plant; and 

iii) isolating said glucocererbrosidase from said interstitial fluid. 

5 

11. A method for producing alpha-galactosidase, comprising the steps of: 

a) growing a plant, wherein said plant comprises a recombinant expression 
construct comprising a nucleotide sequence encoding a alpha-galactosidase and a 
promoter that regulates the expression of the nucleotide sequence in said plant; and 
10 b) isolating alpha-galactosidase from said plant. 



12. The method according to Claim 11, wherein said isolating comprises the steps of: 

i) breaking open said plant; 

ii) isolating the interstitial fluid of said plant; and 

1 5 iii) isolating said alpha-galactosidase from said interstitial fluid. 

13. A isolated alpha-galactosidase, or a fragment thereof having increased specific 
activity. 

20 14. An RNA viral vector containing a nucleic acid sequence encoding an enzyme 
having increased lysozomal enzyme activity, comprising: 

a) an RNA virus; and 

b) a nucleic acid sequence encoding an enzyme having increased lysozomal 
enzyme activity, the nucleic acid sequence inserted into the RNA virus, such that the 

25 enzyme having increased lysozomal enzyme activity encoded by the gene is produced in 
a plant cell that is part of a whole plant, or a cell in culture. 

15. The RNA viral vector of claim 14 wherein the nucleic acid encodes 
alpha-galactosidase activity. 

30 

16. The RNA viral vector of claim 14 wherein the nucleic acid encodes 
glucocerebrosidase activity. 
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17. A isolated alpha-galactosidase, or a fragment thereof having a post-translational 
modification pattern provided by a host plant. 

18. A isolated alpha-galactosidase, or a fragment thereof having a post-translational 
5 modification pattern provided by a host, wherein the alpha-galactosidase has increased 

specific activity when compared to alpha-galactosidase made by an animal cell. 

19. A isolated alpha-galactosidase, or a fragment thereof having a post-translational 
modification pattern provided by a host plant, wherein the post-translational modification 

10 pattern is a glycosylation pattern. 
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FIGURE 12 



GTATTTTTACAACAATTACCAACAACAACAAACAACAAACAACATTACAATT 
ACTATTTACAATTACAATGGCATACACA 

CAGACAGCTACCACATCAGCTTTGCTGGACACTGTCCGAGGAAACAACTCCT 
TGGTCAATGATCTAGCAAAGCGTCGTCT 

TTACGACACAGCGGTTGAAGAGTTTAACGCTCGTGACCGCAGGCCCAAGGTG 
AACTTTTCAAAAGTAATAAGCGAGGAGC 

AGACGCTTATTGCTACCCGGGCGTATCCAGAATTCCAAATTACATTTTATAAC 
ACGCAAAATGCCGTGCATTCGCTTGCA 

GGTGGATTGCGATCTTTAGAACTGGAATATCTGATGATGCAAATTCCCTACGG 
ATCATTGACTTATGACATAGGCGGGAA 

TTTTGCATCGCATCTGTTCAAGGGACGAGCATATGTACACTGCTGTATGCCCA 
ACCTGGACGTTCGAGACATCATGCGGC 

ACGAAGGCCAGAAAGACAGTATTGAACTATACCTTTCTAGGCTAGAGAGAGG 
GGGGAAAACAGTCCCCAACTTCCAAAAG 

GAAGCATTTGACAGATACGCAGAAATTCCTGAAGACGCTGTCTGTCACAATA 
CTTTCCAGACAATGCGACATCAGCCGAT 

GCAGCAATCAGGCAGAGTGTATGCCATTGCGCTACACAGCATATATGACATA 
CCAGCCGATGAGTTCGGGGCGGCACTCT 

TGAGGAAAAATGTCCATACGTGCTATGCCGCTTTCCACTTCTCTGAGAACCTG 
CTTCTTGAAGATTCATACGTCAATTTG 

GACGAAATCAACGCGTGTTTTTCGCGCGATGGAGACAAGTTGACCTTTTCTTT 
TGCATCAGAGAGTACTCTTAATTATTG 

TCATAGTTATTCTAATATTCTTAAGTATGTGTGCAAAACITACTTCCCGGCCTC 
TAATAGAGAGGTTTACATGAAGGAGT 

TTTTAGTCACCAGAGTTAATACCTGGTTTTGTAAGTTTTCTAGAATAGATACT 
TTTCTTTTGTACAAAGGTGTGGCCCAT 

AAAAGTGTAGATAGTGAGCAGTTTTATACTGCAATGGAAGACGCATGGCATT 
ACAAAAAGACTCTTGCAATGTGCAACAG 

CGAGAGAATCCTCCTTGAGGATTCATCATCAGTCAATTACTGGTTTCCCAAAA 
TGAGGGATATGGTCATCGTACCATTAT 

TCGACATTTCTTTGGAGACTAGTAAGAGGACGCGCAAGGAAGTCTTAGTGTC 
CAAGGATTTCGTGTTTACAGTGCTTAAC 

CACATTCGAACATACCAGGCGAAAGCTCTTACATACGCAAATGTTTTGTCCTT 
TGTCGAATCGATTCGATCGAGGGTAAT 

CATTAACGGTGTGACAGCGAGGTCCGAATGGGATGTGGACAAATCTTTGTTA 
CAATCCTTGTCCATGACGTTTTACCTGC 

ATACTAAGCTTGCCGTTCTAAAGGATGACTTACTGATTAGCAAGTTTAGTCTC 
GGTTCGAAAACGGTGTGCCAGCATGTG 

TGGGATGAGATTTCGCTGGCGTTTGGGAACGCATTTCCCTCCGTGAAAGAGA 
GGCTCTTGAACAGGAAACTTATCAGAGT . 

GGCAGGCGACGCATTAGAGATCAGGGTGCCTGATCTATATGTGACCTTCCAC 
GACAGATTAGTGACTGAGTACAAGGCCT 
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CTGTGGACATGCCTGCGCTTGACATTAGGAAGAAGATGGAAGAAACGGAAGT 
GATGTACAATGCACTTTCAGAGTTATCG 

GTGTTAAGGGAGTCTGACAAATTCGATGTTGATGTTTTTTCCCAGATGTGCCA 
ATCTTTGGAAGTTGACCCAATGACGGC 

AGCGAAGGTTATAGTCGCGGTCATGAGCAATGAGAGCGGTCTGACTCTCACA 
TTTGAACGACCTACTGAGGCGAATGTTG 

CGCTAGCTTTACAGGATCAAGAGAAGGCTTCAGAAGGTGCTTTGGTAGTTAC 
CTCAAGAGAAGTTGAAGAACCGTCCATG 

AAGGGTTCGATGGCCAGAGGAGAGTTACAATTAGCTGGTCTTGCTGGAGATC 
ATCCGGAGTCGTCCTATTCTAAGAACGA 

GGAGATAGAGTCTTTAGAGCAGTTTCATATGGCAACGGCAGATTCGTTAATT 
CGTAAGCAGATGAGCTCGATTGTGTACA 

CGGGTCCGATTAAAGTTCAGCAAATGAAAAACTTTATCGATAGCCTGGTAGC 
ATCACTATCTGCTGCGGTGTCGAATCTC 

GTCAAGATCCTCAAAGATACAGCTGCTATTGACCTTGAAACCCGTCAAAAGT 
TTGGAGTCTTGGATGTTGCATCTAGGAA 

GTGGTTAATCAAACCAACGGCCAAGAGTCATGCATGGGGTGTTGTTGAAACC 
CACGCGAGGAAGTATCATGTGGCGCTTT 

TGGAATATGATGAGCAGGGTGTGGTGACATGCGATGATTGGAGAAGAGTAGC 
TGTCAGCTCTGAGTCTGTTGTTTATTCC 

GACATGGCGAAACTCAGAACTCTGCGCAGACTGCTTCGAAACGGAGAACCGC 
ATGTCAGTAGCGCAAAGGTTGTTCTTGT 

GGACGGAGTTCCGGGCTGTGGGAAAACCAAAGAAATTCTTTCCAGGGTTAAT 
TTTGATGAAGATCTAATTTTAGTACCTG 

GGAAGCAAGCCGCGGAAATGATCAGAAGACGTGCGAATTCCTCAGGGATTAT 
TGTGGCCACGAAGGACAACGTTAAAACC 

GTTGATTCTTTCATGATGAATTTTGGGAAAAGCACACGCTGTCAGTTCAAGAG 
GTTATTCATTGATGAAGGGTTGATGTT 

GCATACTGGTTGTGTTAATTTTCTTGTGGCGATGTCATTGTGCGAAATTGCAT 
ATGTTTACGGAGACACACAGCAGATTC 

CATACATCAATAGAGTTTCAGGATTCCCGTACCCCGCCCATTTTGCCAAATTG 
GAAGTTGACGAGGTGGAGACACGCAGA 

ACTACTCTCCGTTGTCCAGCCGATGTCACACATTATCTGAACAGGAGATATGA 
GGGCTTTGTCATGAGCACTTCTTCGGT 

TAAAAAGTCTGTTTCGCAGGAGATGGTCGGCGGAGCCGCCGTGATCAATCCG 
ATCTCAAAACCCTTGCATGGCAAGATCC 

TGACTTTTACCCAATCGGATAAAGAAGCTCTGCTTTCAAGAGGGTATTCAGAT 
GTTCACACTGTGCATGAAGTGCAAGGC 

GAGACATACTCTGATGTTTCACTAGTTAGGTTAACCCCTACACCAGTCTCCAT 
CATTGCAGGAGACAGCCCACATGTTTT 

GGTCGCATTGTCAAGGCACACCTGTTCGCTCAAGTACTACACTGTTGTTATGG 
ATCCTTTAGTTAGTATCATTAGAGATC 

TAGAGAAACTTAGCTCGTACTTGTTAGATATGTATAAGGTCGATGCAGGAAC 
ACAATAGCAATTACAGATTGACTCGGTG 

TTCAAAGGTTCCAATCTTTTTGTTGCAGCGCCAAAGACTGGTGATATTTCTGA 
TATGCAGTTTTACTATGATAAGTGTCT 
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CCCAGGCAACAGCACCATGATGAATAATTTTGATGCTGTTACCATGAGGTTG 
ACTGACATTTCATTGAATGTCAAAGATT 

GCATATTGGATATGTCTAAGTCTGTTGCTGCGCCTAAGGATCAAATCAAACCA 
CTAATACCTATGGTACGAACGGCGGCA 

GAAATGCCACGCCAGACTGGACTATTGGAAAATTTAGTGGCGATGATTAAAA 
GGAACTTTAACGCACCCGAGTTGTCTGG 

CATCATTGATATTGAAAATACTGCATCTTTAGTTGTAGATAAGTTTTTTGATA 
GTTATTTGCTTAAAGAAAAAAGAAAAC 

CAAATAAAAATGTTTCTTTGTTCAGTAGAGAGTCTCTCAATAGATGGTTAGAA 
AAGCAGGAACAGGTAACAATAGGCCAG 

CTCGCAGATTTTGATTTTGTAGATTTGCCAGCAGTTGATCAGTACAGACACAT 
GATTAAAGCACAACCCAAGCAAAAATT 

GGACACTTCAATCCAAACGGAGTACCCGGCTTTGCAGACGATTGTGTACCAT 
TCAAAAAAGATCAATGCAATATTTGGCC 

CGTTGTTTAGTGAGCTTACTAGGCAATTACTGGACAGTGTTGATTCGAGCAGA 
TTTTTGTTTTTCACAAGAAAGACACCA 

GCGCAGATTGAGGATTTCTTCGGAGATCTCGACAGTCATGTGCCGATGGATG 
TCTTGGAGCTGGATATATCAAAATACGA 

CAAATCTCAGAATGAATTCCACTGTGCAGTAGAATACGAGATCTGGCGAAGA 
TTGGGTTTTGAAGACTTCTTGGGAGAAG 

TTTGGAAACAAGGGCATAGAAAGACCACCCTCAAGGATTATACCGCAGGTAT 
AAAAACTTGCATCTGGTATCAAAGAAAG 

AGCGGGGACGTCACGACGTTCATTGGAAACACTGTGATCATTGCTGCATGTTT 
GGCCTCGATGCTTCCGATGGAGAAAAT 

AATCAAAGGAGCCTTTTGCGGTGACGATAGTCTGCTGTACTTTCCAAAGGGTT 
GTGAGTTTCCGGATGTGCAACACTCCG 

CGAATCTTATGTGGAATTTTGAAGCAAAACTGTTTAAAAAACAGTATGGATA 
CTTTTGCGGAAGATATGTAATACATCAC 

GACAGAGGATGCATTGTGTATTACGATCCCCTAAAGTTGATCTCGAAACTTG 
GTGCTAAACACATCAAGGATTGGGAACA 

CTTGGAGGAGTTCAGAAGGTCTCTTTGTGATGTTGCTGTTTCGTTGAACAATT 
GTGCGTATTACACACAGTTGGACGACG 

CTGTATGGGAGGTTCATAAGACCGCCCCTCCAGGTTCGTTTGTTTATAAAAGT 
CTGGTGAAGTATTTGTCTGATAAAGTT 

CTTTTTAGAAGTTTGTTTATAGATGGCTCTAGTTGTTAAAGGAAAAGTGAATA 
TCAATGAGTTTATCGACCTGACAAAAA 

TGGAGAAGATCTTACCGTCGATGTTTACCCCTGTAAAGAGTGTTATGTGTTCC 
AAAGTTGATAAAATAATGGTTCATGAG 

AATGAGTCATTGTCAGAGGTGAACCTTCTTAAAGGAGTTAAGCTTATTGATA 
GTGGATACGTCTGTTTAGCCGGTTTGGT 

CGTCACGGGCGAGTGGAACTTGCCTGACAATTGCAGAGGAGGTGTGAGCGTG 
TGTCTGGTGGACAAAAGGATGGAAAGAG 

CCGACGAGGCCACTCTCGGATCTTACTACACAGCAGCTGCAAAGAAAAGATT 
TCAGTTCAAGGTCGTTCCCAATTATGCT 

ATAACCACCCAGGACGCGATGAAAAACGTCTGGCAAGTTTTAGTTAATATTA 
GAAATGTGAAGATGTCAGCGGGTTTCTG 
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TCCGCTTTCTCTGGAGTTTGTGTCGGTGTGTATTGTTTATAGAAATAATATAA 
AATTAGGTTTGAGAGAGAAGATTACAA 

ACGTGAGAGACGGAGGGCCCATGGAACTTACAGAAGAAGTCGTTGATGAGTT 
CATGGAAGATGTCCCTATGTCGATCAGG 

CTTGCAAAGTTTCGATCTCGAACCGGAAAAAAGAGTGATGTCCGCAAAGGGA 
AAAATAGTAGTAATGATCGGTCAGTGCC 

GAACAAGAACTATAGAAATGTTAAGGATTTTGGAGGAATGAGTTTTAAAAAG 
AATAATTTAATCGATGATGATTCGGAGG 

CTACTGTCGCCGAATCGGATTCGTTTTAAATAGATCTTACAGTATCACTACTC 
CATCTCAGTTCGTGTTCTTGTCATTAA 

TATGCAGGTGCTGAACACCATGGTGAACAAACACTTCTTGTCCCTTTCGGTCC 
TCATCGTCCTCCTTGGCCTCTCCTCCA 

ACTTGACAGCCGGCATGCTGGACAATGGATTGGCAAGGACGCCTACCATGGG 
GTGGCTGCACTGGGAGCGCTTCATGTGC 

AACCTTGACTGCCAGGAAGAGCCAGATTCCTGCATCAGTGAGAAGCTCTTCA 
TGGAGATGGCAGAGCTCATGGTCTCAGA 

AGGCTGGAAGGATGCAGGTTATGAGTACCTCTGCATTGATGACTGTTGGATG 
GCTCCCCAAAGAGATTCAGAAGGCAGAC 

TTCAGGCAGACCCTCAGCGCTTTCCTCATGGGATTCGCCAGCTAGCTAATTAT 
GTTCACAGCAAAGGACTGAAGCTAGGG 

ATTTATGCAGATGTTGGAAATAAAACCTGCGCAGGCTTCCCTGGGAGTTTTGG 
ATACTACGACATTGATGCCCAGACCTT 

TGCTGACTGGGGAGTAGATCTGCTAAAATTTGATGGTTGTTACTGTGACAGTT 
TGGAAAATTTGGCAGATGGTTATAAGC 

ACATGTCCTTGGCCCTGAATAGGACTGGCAGAAGCATTGTGTACTCCTGTGA 
GTGGCCTCTTTATATGTGGCCCTTTCAA 

AAGCCCAATTATACAGAAATCCGACAGTACTGCAATCACTGGCGAAATTTTG 
CTGACATTGATGATTCCTGGAAAAGTAT 

AAAGAGTATCTTGGACTGGACATCTTTTAACCAGGAGAGAATTGTTGATGTT 
GCTGGACCAGGGGGTTGGAATGACCCAG 

ATATGTTAGTGATTGGCAACTTTGGCCTCAGCTGGAATCAGCAAGTAACTCA 
GATGGCCCTCTGGGCTATCATGGCTGCT 

CCTTTATTCATGTCTAATGACCTCCGACACATCAGCCCTCAAGCCAAAGCTCT 
CCTTCAGGATAAGGACGTAATTGCCAT 

CAATCAGGACCCCTTGGGCAAGCAAGGGTACCAGCTTAGACAGGGAGACAA 
CTTTGAAGTGTGGGAACGACCTCTCTCAG 

GCTTAGCCTGGGCTGTAGCTATGATAAACCGGCAGGAGATTGGTGGACCTCG 
CTCTTATACCATCGCAGTTGCTTCCCTG 

GGTAAAGGAGTGGCCTGTAATCCTGCCTGCTTCATCACACAGCTCCTCCCTGT 
GAAAAGGAAGCTAGGGTTCTATGAATG 

GACTTCAAGGTTAAGAAGTCACATAAATCCCACAGGCACTGTTTTGCTTCAGC 
TAtctgaaaaggacgaattatgaCCTA 

GGCTCGCAAAGTTTCGAACCAAATCCTCAAAAAGAGGTCCGAAAAATAATAA 
TAATTTAGGTAAGGGGCGTTCAGGCGGA 

AGGCCTAAACCAAAAAGTTTTGATGAAGTTGAAAAAGAGTTTGATAATTTGA 
TTGAAGATGAAGCCGAGACGTCGGTCGC 
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GGATTCTGATTCGTATTAAATATGTCTTACTCAATCACTTCT 
GTGTTTTTGTCATCTGTATGGGCTGA 

CCCTATAGAATTGTTAAACGTTTGTACAAATTCGTTAGGTAACCAGTTTCAAA 
CACAGCAAGCAAGAACTACTGTTCAAC 

AGCAGTTCAGCGAGGTGTGGAAACCTTTCCCTCAGAGCACCGTCAGATTTCCT 
GGCGATGTTTATAAGGTGTACAGGTAC 

AATGCAGTTTTAGATCCTCTAATTACTGCGTTGCTGGGGGCTTTTGATACTAG 
GAATAGAATAATCGAAGTAGAAAACCA 

GCAGAGTCCGACAACAGCTGAAACGTTAGATGCTACCCGCAGGGTAGACGA 
CGCTACGGTTGCAATTCGGTCTGCTATAA 

ATAATTTAGTTAATGAACTAGTAAGAGGTACTGGACTGTACAATCAGAATAC 
TTTTGAAAGTATGTCTGGGTTGGTCTGG 

ACCTCTGCACCTGCATCTTAAATGCATAGGTGCTGAAATATAAAGTTTGTGTT 
TCTAAAACACACGTGGTACGTACGATA 

ACGTACAGTGTTTTTCCCTCCACTTAAATCGAAGGGTAGTGTCTTGGAGCGCG 
CGGAGTAAACATATATGGTTCATATAT 

GTCCGTAGGCACGTAAAAAAAGCGAGGGATTCGAATTCCCCCGGAACCCCCG 
GTTGGGGCCCAGGTACCAATTCTTGAAG 

ACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAA 
TGGTTTCTTAGACGTCAGGTGGCACTT 

TTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCA 
AATATGTATCCGCTCATGAGACAATAA 

CCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAAC 
ATTTCCGTGTCGCCCTTATTCCCTTTTT 

TGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAA 
AAGATGCTGAAGATCAGTTGGGTGCAC 

GAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTT 
TCGCCCCGAAGAACGTTTTCCAATGATG 

AGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGG 
GCAAGAGCAACTCGGTCGCCGCATACA 

CTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTA 
CGGATGGCATGACAGTAAGAGAATTAT 

GCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGAC 
AACGATCGGAGGACCGAAGGAGCTAACC 

GCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACC 
GGAGCTGAATGAAGCCATACCAAACGA 

CGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTA 
TTAACTGGCGAACTACTTACTCTAGCTT 

CCGGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACT 
TCTGCGCTCGGCCCTTCCGGCTGGCTGG 

TTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGC 
AGCACTGGGGCCAGATGGTAAGCCCTC 

CCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGA 
AATAGACAGATCGCTGAGATAGGTGCCT 

CACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAG 
ATTGATTTAAAACTTCATTTTTAATTT 
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AAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTA 
ACGTGAGTTTTCGTTCCACTGAGCGTC 

AGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCG 
TAATCTGCTGCTTGCAAACAAAAAAAC 

CACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTT 
CCGAAGGTAACTGGCTTCAGCAGAGCG 

CAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAA 
GAACTCTGTAGCACCGCCTACATACCT 

CGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTC 
TTACCGGGTTGGACTCAAGACGATAGT 

TACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGC 
CCAGCTTGGAGCGAACGACCTACACCGAA 

CTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGA 
GAAAGGCGGACAGGTATCCGGTAAGCGG 

CAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTG 
GTATCTTTATAGTCCTGTCGGGTTTCGCC 

ACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTA 
TGGAAAAACGCCAGCAACGCGGCCTTT 

TTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTA 
TCCCCTGATTCTGTGGATAACCGTAT 

TACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGC 
AGCGAGTCAGTGAGCGAGGAAGCGGAAG 

AGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGC 
ATATGGTGCACTCTCAGTACAATCTGC 

TCTGATGCCGCATAGTTAAGCCAGTATACACTCCGCTATCGCTACGTGACTGG 
GTCATGGCTGCGCCCCGACACCCGCCA 

ACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAG 
ACAAGCTGTGACCGTCTCCGGGAGCTG 

CATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGCTGCGG 
TAAAGCTCATCAGCGTGGTCGTGAAGCG 

ATTCACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGA 
AGCGTTAATGTCTGGCTTCTGATAAAG 

CGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTTGATGCCTCCGTGT 
AAGGGGGAATTTCTGTTCATGGGGGTA 

ATGATACCGATGAAACGAGAGAGGATGCTCACGATACGGGTTACTGATGATG 
AACATGCCCGGTTACTGGAACGTTGTGA 

GGGTAAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCA 
GGGTCAATGCCAGCGCTTCGTTAATACAG 

ATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGCAGATCCGGAA 
CATAATGGTGCAGGGCGCTGACTTCCGC 

GTTTCCAGACTTTACGAAACACGGAAACCGAAGACCATTCATGTTGTTGCTCA 
GGTCGCAGACGTTTTGCAGCAGCAGTC 

GCTTCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGGCAAC 
CCCGCCAGCCTAGCCGGGTCCTCAACG 

ACAGGAGCACGATCATGCGCACCCGTGGCCAGGACCCAACGCTGCCCGAGAT 
GCGCCGCGTGCGGCTGCTGGAGATGGCG 
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GACGCGATGGATATGTTCTGCCAAGGGTTGGTTTGCGCATTCACAGTTCTCCG 
CAAGAATTGATTGGCTCCAATTCTTGG 

AGTGGTGAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGG 
CCCGGCTCCATGCACCGCGACGCAACGC 

GGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAACCCGTTC 
CATGTGCTCGCCGAGGCGGCATAAATCG 

CCGTGACGATCAGCGGTCCAGTGATCGAAGTTAGGCTGGTAAGAGCCGCGAG 
CGATCCTTGAAGCTGTCCCTGATGGTCG 

TCATCTACCTGCCTGGACAGCATGGCCTGCAACGCGGGCATCCCGATGCCGC 
CGGAAGCGAGAAGAATCATAATGGGGAA 

GGCCATCCAGCCTCGCGTCGCGAACGCCAGCAAGACGTAGCCCAGCGCGTCG 
GCCGCCATGCCGGCGATAATGGCCTGCT 

TCTCGCCGAAACGTTTGGTGGCGGGACCAGTGACGAAGGCTTGAGCGAGGGC 
GTGCAAGATTCCGAATACCGCAAGCGAC 

AGGCCGATCATCGTCGCGCTCCAGCGAAAGCGGTCCTCGCCGAAAATGACCC 
AGAGCGCTGCCGGCACCTGTCCTACGAG 

TTGCATGATAAAGAAGACAGTCATAAGTGCGGCGACGATAGTCATGCCCCGC 

GCCCACCGGAAGGAGCTGACTGGGTTGA 

AGGCTCTCAAGGGCATCGGTCGAGATTTAGGTGACACTATA 
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FIGURE 13 



GTATTTTTACAACAATTACCAACAACAACAAACAACAGACAACATTACAATT 
ACTATTTACAATTACAATGGCATACACA 

CAGACAGCTACCACATCAGCTTTGCTGGACACTGTCCGAGGAAACAACTCCT 
TGGTCAATGATCTAGCAAAGCGTCGTCT 

TTACGACACAGCGGTTGAAGAGTTTAACGCTCGTGACCGCAGGCCCAAGGTG 
AACTTTTCAAAAGTAATAAGCGAGGAGC 

AGACGCTTATTGCTACCCGGGCGTATCCAGAATTCCAAATTACATTTTATAAC 
ACGCAAAATGCCGTGCATTCGCTTGCA 

GGTGGATTGCGATCTTTAGAACTGGAATATCTGATGATGCAAATTCCCTACGG 
ATCATTGACTTATGACATAGGCGGGAA 

TTTTGCATCGCATCTGTTCAAGGGACGAGCATATGTACACTGCTGCATGCCCA 
ACCTGGACGTTCGAGACATCATGCGGC 

ACGAAGGCCAGAAAGACAGTATTGAACTATACCTTTCTAGGCTAGAGAGAGG 
GGGGAAAACAGTCCCCAACTTCCAAAAG 

GAAGCATTTGACAGATACGCAGAAATTCCTGAAGACGCTGTCTGTCACAATA 
CTTTCCAGACATGCGAACATCAGCCGAT 

GCAGCAATCAGGCAGAGTGTATGCCATTGCGCTACACAGCATATATGACATA 
CCAGCCGATGAGTTCGGGGCGGCACTCT 

TGAGGAAAAATGTCCATACGTGCTATGCCGCTTTCCACTTCTCCGAGAACCTG 
CTTCTTGAAGATTCATGCGTCAATTTG 

GACGAAATCAACGCGTGTTTTTCGCGCGATGGAGACAAGTTGACCTTTTCTTT 
TGCATCAGAGAGTACTCTTAATTACTG 

TCATAGTTATTCTAATATTCTTAAGTATGTGTGCAAAACTTACTTCCCGGCCTC 
TAATAGAGAGGTTTACATGAAGGAGT 

TTTTAGTCACCAGAGTTAATACCTGGTTTTGTAAGTTTTCTAGAATAGATACT 
TTTCTTTTGTACAAAGGTGTGGCCCAT 

AAAAGTGTAGATAGTGAGCAGTTTTATACTGCAATGGAAGACGCATGGCATT 
ACAAAAAGACTCTTGCAATGTGCAACAG 

CGAGAGAATCCTCCTTGGGGATTCATCATCAGTCAATTACTGGTTTCCCAAAA 
TGAGGGATATGGTCATCGTACCATTAT 

TCGACATTTCTTTGGAGACTAGTAAGAGGACGCGCAAGGAAGTCTTAGTGTC 
CAAGGATTTCGTGTTCACAGTGCTTAAC 

CACATTCGAACATACCAGGCGAAAGCTCTTACATACGCAAATGTTTTGTCCTT 
CGTCGAATCGATTCGATCGAGGGTAAT 

CATTAACGGTGTGACAGCGAGGTCCGAATGGGATGTGGACAAATCTTTGTTA 
CAATCCTTGTCCATGACGTTTTACCTGC 

ATACTAAGCTTGCCGTTCTAAAGGATGACTTACTGATTAGCAAGTTTAGTCTC 
GGTTCGAAAACGGTGTGCCAGCATGTG 

TGGGATGAGATTTCGCTGGCGTTTGGGAACGCATTTCCCTCCGTGAAAGAGA 
GGCTCTTGAACAGGAAACTTATCAGAGT 

GGCAGGCGACGCATTAGAGATCAGGGTGCCTGATCTATATGTGACCTTCCAC 
GACAGATTAGTGACTGAGTACAAGGCCT 
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CTGTGGACATGCCTGCGCTTGACATTAGGAAGAAGATGGAAGAAACGGAAGT 
GATGTACAATGCACTTTCAGAATTATCG 

GTGTTAAGGGAGTCTGACAAATTCGATGTTGATGTTTTTTCCCAGATGTGCCA 
ATCTTTGGAAGTTGACCCAATGACGGC 

AGCGAAGGTTATAGTCGCGGTCATGAGCAATGAGAGCGGTCTGACTCTCACA 
TTTGAACGACCTACTGAGGCGAATGTTG 

CGCTAGCTTTACAGGATCAAGAGAAGGCTTCAGAAGGTGCATTGGTAGTTAC 
CTCAAGAGAAGTTGAAGAACCGTCCATG 

AAGGGTTCGATGGCCAGAGGAGAGTTACAATTAGCTGGTCTTGCTGGAGATC 
. ATCCGGAATCGTCCTATTCTAAGAACGA 
GGAGATAGAGTCTTTAGAGCAGTTTCATATGGCGACGGCAGATTCGTTAATT 
CGTAAGCAGATGAGCTCGATTGTGTACA 

CGGGTCCGATTAAAGTTCAGCAAATGAAAAACTTTATCGATAGCCTGGTAGC 
ATCACTATCTGCTGCGGTGTCGAATCTC 

GTCAAGATCCTCAAAGATACAGCTGCTATTGACCTTGAAACCCGTCAAAAGT 
TTGGAGTCTTGGATGTTGCATCTAGGAA 

GTGGTTAATCAAACCAACGGCCAAGAGTCATGCATGGGGTGTTGTTGAAACC 
CACGCGAGGGAGTATCATGTGGCGCTTT 

TGGAATATGATGAGCAGGGTGTGGTGACATGCGATGATTGGAGAAGAGTAGC 
TGTTAGCTCTGAGTCTGTTGTTTATTCC 

GACATGGCGAAACTCAGAACTCTGCGCAGACTGCTTCGAAACGGAGAACCGC 
ATGTCAGTAGCGCAAAGGTTGTTCTTGT 

GGACGGAGTTCCGGGCTGTGGAAAAACCAAAGAAATTCTTTCCAGGGTTAAT 
TTTGATGAAGATCTAATTTTAGTACCTG 

GGAAGCAAGCCGCGGAAATGATCAGAAGACGTGCGAATTCCTCAGGGATTAT 
TGTGGCCACGAAGGACAACGTTAAAACC 

GTTGATTCTTTCATGATGAATTTTGGGAAAAGCACACGCTGTCAGTTCAAGAG 
GTTATTCATTGATGAAGGGTTGATGTT 

GCATACTGGTTGTGTTAATTTTCTTGTGGCGATGTCATTGTGCGAAATTGCAT 
ATGTTTACGGAGACACACAGCAGATTC 

CATACATCAATAGAGTTTCAGGATTCCCGTACCCCGCCCATTTTGCCAAATTG 
GAAGTTGACGAGGTGGAGACACGCAGA 

ACTACTCTCCGTTGTCCAGCCGATGTCACACATTATCTGAACAGGAGATATGA 
GGGCTTTGTCATGAGCACTTCTTCGGT 

TAAAAAGTCTGTTTCGCAGGAGATGGTCGGCGGAGCCGCCGTGATCAATCCG 
ATCTCAAAACCCTTGCATGGCAAGATCC 

TGACTTTTACCCAATCGGATAAAGAAGCTCTGCTTTCAAGAGGGTATTCAGAT 
GTTCACACTGTGCATGAAGTGCAAGGC 

GAGACATACTCTGATGTTTCACTAGTTAGGTTAACCCCTACACCGGTCTCCAT 
CATTGCAGGAGACAGCCCACATGTTTT 

GGTCGCATTGTCAAGGCACACCTGTTCGCTCAAGTACTACACTGTTGTTATGG 
ATCCTTTAGTTAGTATCATTAGAGATC 

TAGAGAAACTTAGCTCGTACTTGTTAGATATGTATAAGGTCGATGCAGGAAC 
ACAATAGCAATTACAGATTGACTCGGTG 

TTCAAAGGTTCCAATCTTTTTGTTGCAGCGCCAAAGACTGGTGATATTTCTGA 
TATGCAGTTTTACTATGATAAGTGTCT 
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CCCAGGCAACAGCACCATGATGAATAATTTTGATGCTGTTACCATGAGGTTG 
ACTGACATTTCATTGAATGTCAAAGATT 

GCATATTGGATATGTCTAAGTCTGTTGCTGCACCTAAGGATCAAATCAAACCA 
CTAATACCTATGGTACGAACGGCGGCA 

GAAATGCCACGCCAGACTGGACTATTGGAAAATTTAGTGGCGATGATTAAAA 
GAAACTTTAACGCACCCGAGTTGTCTGG 

CATCATTGATATTGAAAATACTGCATCTTTGGTTGTAGATAAGTTTTTTGATA 
GTTATTTGCTTAAAGAAAAAAGAAAAC 

CAAATAAAAATGTTTCTTTGTTCAGTAGAGAGTCTCTCAATAGATGGTTAGAA 
AAGCAGGAACAGGTAACAATAGGCCAG 

CTCGCAGATTTTGATTTTGTGGATTTGCCAGCAGTTGATCAGTACAGACACAT 
GATTAAAGCACAACCCAAACAAAAGTT 

GGACACTTCAATCCAAACGGAGTACCCGGCTTTGCAGACGATTGTGTACCAT 
TCAAAAAAGATCAATGCAATATTCGGCC 

CGTTGTTTAGTGAGCTTACTAGGCAATTACTGGACAGTGTTGATTCGAGCAGA 
TTTTTGTTTTTCACAAGAAAGACACCA 

GCGCAGATTGAGGATTTCTTCGGAGATCTCGACAGTCATGTGCCGATGGATG 
TCTTGGAGCTGGATATATCAAAATACGA 

CAAATCTCAGAATGAATTCCACTGTGCAGTAGAATACGAGATCTGGCGAAGA 
TTGGGTTTCGAAGACTTCTTGGGAGAAG 

TTTGGAAACAAGGGCATAGAAAGACCACCCTCAAGGATTATACCGCAGGTAT 
AAAAACTTGCATCTGGTATCAAAGAAAG 

AGCGGGGACGTCACGACGTTCATTGGAAACACTGTGATCATTGCTGCATGTTT 
GGCCTCGATGCTTCCGATGGAGAAAAT 

AATCAAAGGAGCCTTTTGCGGTGACGATAGTCTGCTGTACTTTCCAAAGGGTT 
GTGAGTTTCCGGATGTGCAACACTCCG 

CGAATCTTATGTGGAATTTTGAAGCAAAACTGTTTAAAAAACAGTATGGATA 
CTTTTGCGGAAGATATGTAATACATCAC 

GACAGAGGATGCATTGTGTATTACGATCCCCTAAAGTTGATCTCGAAACTTG 
GTGCTAAACACATCAAGGATTGGGAACA 

CTTGGAGGAGTTCAGAAGGTCTCTTTGTGATGTTGCTGTTTCGTTGAACAATT 
GTGCGTATTACACACAGTTGGACGACG 

CTGTATGGGAGGTTCATAAGACCGCCCCTCCAGGTTCGTTTGTTTATAAAAGT 
CTGGTGAAGTATTTGTCTGATAAAGTT 

CTTTTTAGAAGTTTGTTTATAGATGGCTCTAGTTGTTAAAGGAAAAGTGAATA 
TCAATGAGTTTATCGACCTGACAAAAA 

TGGAGAAGATCTTACCGTCGATGTTTACCCCTGTAAAGAGTGTTATGTGTTCC 
AAAGTTGATAAAATAATGGTTCATGAG 

AATGAGTCATTGTCAGGGGTGAACCTTCTTAAAGGAGTTAAGCTTATTGATA 
GTGGATACGTCTGTTTAGCCGGTTTGGT 

CGTCACGGGCGAGTGGAACTTGCCTGACAATTGCAGAGGAGGTGTGAGCGTG 
TGTCTGGTGGACAAAAGGATGGAAAGAG 

CCGACGAGGCCATTCTCGGATCTTACTACACAGCAGCTGCAAAGAAAAGATT 
TCAGTTCAAGGTCGTTCCCAATTATGCT 

ATAACCACCCAGGACGCGATGAGAAACGTCTGGCAAGTTTTAGTTAATATTA 
GAAATGTGAAGATGTCAGCGGGTTTCTG 
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TCCGCTTTCTCTGGAGTTTGTGTCGGTGTGTATTGTTTATAGAAATAATATAA 
AATTAGGTTTGAGAGAGAAGATTACAA 

ACGTGAGAGACGGAGGGCCCATGGAACTTACAGAAGAAGTCGTTGATGAGTT 
CATGGAAGATGTCCCTATGTCGATCAGG 

CTTGCAAAGTTTCGATCTCGAACCGGAAAAAAGAGTGATGTCCGCAAAGGGA 
AAAATAGTAGTAGTGATCGGTCAGTGCC 

GAACAAGAACTATAGAAATGTTAAGGATTTTGGAGGAATGAGTTTTAAAAAG 
AATAATTTAATCGATGATGATTCGGAGG 

CTACTGTCGCCGAATCGGATTCGTTTTAAATAGATCTTACAGTATCACTACTC 
CATCTCAGTTCGTGTTCTTGTCAttaa 

ttaaaatgcagctgaggaacccagaactacatctgggctgcgcgcttgcgcttcgcttcctggccctcgtttcctgggac 

atccctggggctagagcactggacaatggattggcaaggacgcctaccatgggctggctgcactgggagcgcttcatgtg 

caaccttgactgccaggaagagccagattcctgcatcagtgagaagctcttcatggagatggcagagctcatggtctcag 

aaggctggaaggatgcaggttatgagtacctctgcattgatgactgttggatggctccccaaagagattcagaaggcaga 

cttcaggcagaccctcagcgctttcctcatgggattcgccagctagctaattatgttcacagcaaaggactgaagctagg 

gatttatgcagatgttggaaataaaacctgcgcaggcttccctgggagttttggatactacgacpttgatgcccagacct 

ttgctgactggggagtagatctgctaaaatttgatggttgttactgtgacagtttggaaaatttggcagatggttataag 

cacatgtccttggccctgaataggactggcagaagcattgtgtactcctgtgagtggcctctttatatgtggccctttca 

aaagcccaattatacagaaatccgacagtactgcaatcactggcgaaattttgctgacattgatgattcctggaaaagta 

taaagagtatcttggactggacatcttttaaccaggagagaattgttgatgttgctggaccagggggttggaatgaccca 

gatatgttagtgattggcaactttggcctcagctggaatcagcaagtaactcagatggccctctgggctatcatggctgc 

tcctttattcatgtctaatgacctccgacacatcagccctcaagccaaagctctccttcaggataaggacgtaattgcca 

tcaatcaggaccccttgggcaagcaagggtaccagcttagacagggagacaactttgaagtgtgggaacgacctctctca 

ggcttagcctgggctgtagctatgataaaccggcaggagattggtggacctcgctcttataccatcgcagttgcttccct 

gggtaaaggagtggcctgtaatcctgcctgcttcatcacacagctcctccctgtgaaaaggaagctagggttctatgaat 

ggacttcaaggttaagaagtcacataaatcccacaggcactgttttgcttcagctatctgaaaaggacgaattatgacct 

aggGGGTAGTCAAGATGCATAATAAATAACGGATTGTGTCCGTAATCACACGT 

GGTGCGTACGATAACGCATAGTGTTTT 

TCCCTCCACTTAAATCGAAGGGTTGTGTCTTGGATCGCGCGGGTCAAATGTAT 
ATGGTTCATATACATCCGCAGGCACGT 

AATAAAGCGAGGGGTTCGGGTCGAGGTCGGCTGTGAAACTCGAAAAGGTTCC 
GGAAAACAAAAAAGAGAGTGGTAGGTAA 

TAGTGTTAATAATAAGAAAATAAATAATAGTGGTAAGAAAGGTTTGAAAGTT 
GAGGAAATTGAGGATAATGTAAGTGATG 

ACGAGTCTATCGCGTCATCGAGTACGTTTTAATCAATATGCCTTATACAATCA 
ACTCTCCGAGCCAATTTGTTTACTTAA 

GTTCCGCTTATGCAGATCCTGTGCAGCTGATCAATCTGTGTACAAATGCATTG 
GGTAACCAGTTTCAAACGCAACAAGCT 

AGGACAACAGTCCAACAGCAATTTGCGGATGCCTGGAAACCTGTGCCTAGTA 
TGACAGTGAGATTTCCTGCATCGGATTT 

CTATGTGTATAGATATAATTCGACGCTTGATCCGTTGATCACGGCGTTATTAA 
ATAGCTTCGATACTAGAAATAGAATAA 

TAGAGGTTGATAATCAACCCGCACCGAATACTACTGAAATCGTTAACGCGAC 
TCAGAGGGTAGACGATGCGACTGTAGCT 

ATAAGGGCTTCAATCAATAATTTGGCTAATGAACtGGTTCGTGGAACTGGCaT 
GTTCAATCAAGCAAGCTTTGAGACTGC 
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gtcactgaagactta 

aaattcagggtggctgataccaaaatcagcagtggttgttcgtccacttaaatataacgattgtcatatctggatccaac 
agttaaaccatgtgatggtgtatactgtggtatggcgtaaaacaacggaaaagtcgctgaagacttaaaattcagggtgg 
ctgataccaaaatcagcagtggttgttcgtccacttaaaaataacgattgtcatatctggatccaacagttaaaccatgt 
gatggtgtatactgtggtatggcgtaaaacaacggagaggttcgaatcctcccctaaccgcgggtagcggccca 
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